Forms, chains, and Stokes' theorem

Only by taking infinitesimally small units for observation (the differential of history, that is, the individual tendencies of men) and attaining to the art of integrating them (that is, finding the sum of these infinitesimals) can we hope to arrive at the laws of history.
Leo Tolstoy (1828-1910) War and Peace

38.1 Integration

This chapter is going to focus on integrals, which were originally formulated in calculus as a type of sum. In fact, the integral symbol int\int, introduced by Leibniz, is a 'long s' to indicate the Latin word summa (or fumma in old-fashioned writing) meaning sum. The fundamental theorem of calculus states that differentiation and integration are the inverses of each other, so that
(38.1) a b d f d x d x = f ( b ) f ( a ) . (38.1) a b d f d x d x = f ( b ) f ( a ) . {:(38.1)int_(a)^(b)((d)f)/((d)x)dx=f(b)-f(a).:}\begin{equation*} \int_{a}^{b} \frac{\mathrm{~d} f}{\mathrm{~d} x} \mathrm{~d} x=f(b)-f(a) . \tag{38.1} \end{equation*}(38.1)ab df dx dx=f(b)f(a).
This equation has a rather intriguing feature: we are integrating the derivative d f / d x d f / d x df//dx\mathrm{d} f / \mathrm{d} xdf/dx over a region ( a x b ) ( a x b ) (a <= x <= b)(a \leq x \leq b)(axb) and the answer depends only on the function f f fff evaluated at the boundary of the region. Moreover, there is an interesting property of the signs on the right-hand side of eqn 38.1: f ( b ) f ( b ) f(b)f(b)f(b) is included with a + sign + sign +sign+\operatorname{sign}+sign, but f ( a ) f ( a ) f(a)f(a)f(a) is included with a sign. You can see where this comes from by thinking of the integral as a sum, slicing up the interval, so that we could write
(38.2) a b d f d x d x Δ x i = 1 N 1 f ( x i + 1 ) f ( x i ) Δ x (38.2) a b d f d x d x Δ x i = 1 N 1 f x i + 1 f x i Δ x {:(38.2)int_(a)^(b)((d)f)/((d)x)dx~~Delta xsum_(i=1)^(N-1)(f(x_(i+1))-f(x_(i)))/(Delta x):}\begin{equation*} \int_{a}^{b} \frac{\mathrm{~d} f}{\mathrm{~d} x} \mathrm{~d} x \approx \Delta x \sum_{i=1}^{N-1} \frac{f\left(x_{i+1}\right)-f\left(x_{i}\right)}{\Delta x} \tag{38.2} \end{equation*}(38.2)ab df dx dxΔxi=1N1f(xi+1)f(xi)Δx
where Δ x Δ x Delta x\Delta xΔx is the width of each slice, so that adjacent terms cancel and all that is left is f ( x N ) f ( x 1 ) f ( b ) f ( a ) f x N f x 1 f ( b ) f ( a ) f(x_(N))-f(x_(1))-=f(b)-f(a)f\left(x_{N}\right)-f\left(x_{1}\right) \equiv f(b)-f(a)f(xN)f(x1)f(b)f(a), the difference between the value of f f fff at the boundary of the region of integration.
This result can be generalized to three dimensions, so that 1 1 ^(1){ }^{1}1
(38.3) f d V = f d S (38.3) f d V = f d S {:(38.3)int vec(grad)fdV=int fd vec(S):}\begin{equation*} \int \vec{\nabla} f \mathrm{~d} V=\int f \mathrm{~d} \vec{S} \tag{38.3} \end{equation*}(38.3)f dV=f dS

38.1 Integration
38.2 Integrating over forms 400 38.3 Anatomy of an integral 401 38.4 Boundaries and chains 404 38.5 Stokes' theorem 405 Chapter summary 408 Exercises 408
\curvearrowright This Chapter represents the logical endpoint for the material in Part V V VVV of the book. However, it lies outside most of our remaining topics, so can be skipped on a first reading. Stokes' theorem is used in Chapter 43.
1 1 ^(1){ }^{1}1 In components, eqn 38.3 becomes
f x i d V = f d S i f x i d V = f d S i int(del f)/(delx^(i))dV=int fdS^(i)\int \frac{\partial f}{\partial x^{i}} \mathrm{~d} V=\int f \mathrm{~d} S^{i}fxi dV=f dSi
2 2 ^(2){ }^{2}2 Sir George G. S. Stokes, 1st Baronet (1919-1903) made numerous, important contributions to mathematical physics in the 19th century. He was the first person to simultaneously be Lu casian Professor of Mathematics, President of the Royal Society, and Member of Parliament for Cambridge University. Newton had also held all three posts, but not at the same time.
where the equivalent of the sign change seen on the right-hand side of eqn 38.1 is accomplished by the surface vector d S d S d vec(S)\mathrm{d} \vec{S}dS changing orientation on either side of the surface. Then putting f = a j f = a j f=a^(j)f=a^{j}f=aj (where j = x , y j = x , y j=x,yj=x, yj=x,y or z z zzz ) into eqn 38.3 and then summing over j j jjj yields
(38.4) a d V = a d S , (38.4) a d V = a d S , {:(38.4)int vec(grad)* vec(a)dV=int vec(a)*d vec(S)",":}\begin{equation*} \int \vec{\nabla} \cdot \vec{a} \mathrm{~d} V=\int \vec{a} \cdot \mathrm{~d} \vec{S}, \tag{38.4} \end{equation*}(38.4)a dV=a dS,
which is the divergence theorem, familiar from vector calculus (and used extensively in electromagnetism and fluid dynamics). This famous result preserves the intuition we have gained from eqn 38.1, namely that the integral of the derivative of a quantity in the interior of a space is equal to some kind of sum of that quantity on the boundary of that space.
We leapt from one dimension to three dimensions. What about the result in the middle? This is a little more complicated and a twodimensional version of this argument yields the result that
(38.5) f × d S = f d , (38.5) f × d S = f d , {:(38.5)int vec(grad)f xxd vec(S)=-ointfd vec(ℓ)",":}\begin{equation*} \int \vec{\nabla} f \times \mathrm{d} \vec{S}=-\oint f \mathrm{~d} \vec{\ell}, \tag{38.5} \end{equation*}(38.5)f×dS=f d,
where the right-hand side is an integral round a contour. This equation follows from the idea that all the gradients of f f fff inside the surface sum to nothing, leaving only a circulating term around the boundary. The i i iii th component of this equation can be written out as ε i k m k f d S m = ε i k m k f d S m = epsi_(ikm)intdel^(k)fdS^(m)=\varepsilon_{i k m} \int \partial^{k} f \mathrm{~d} S^{m}=εikmkf dSm= f d i f d i -ointfdℓ_(i)-\oint f \mathrm{~d} \ell_{i}f di, so that substituting in f = a j f = a j f=a^(j)f=a^{j}f=aj and then setting i = j i = j i=ji=ji=j yields Stokes' theorem 2 2 ^(2){ }^{2}2
(38.6) × a d S = a d (38.6) × a d S = a d {:(38.6)int vec(grad)xx vec(a)*d vec(S)=oint vec(a)*d vec(ℓ):}\begin{equation*} \int \vec{\nabla} \times \vec{a} \cdot \mathrm{~d} \vec{S}=\oint \vec{a} \cdot \mathrm{~d} \vec{\ell} \tag{38.6} \end{equation*}(38.6)×a dS=a d
This equation is also familiar from vector calculus, electromagnetism, and fluid mechanics, but for the purposes of this chapter it is important to notice something about the structure of this equation. It is complicated by the vector product symbol, but once again we have emerged with the intuition that an integral of some kind of gradient of a function inside the interior of a space is related to the integral of the function on the boundary. We are left with the feeling that these various results (eqns 38.1, 38.4, and 38.6) are expressing the same underlying truth, but they all look a bit different and it is hard to see how to unify them.
Can we cut through this dense thicket of different theorems for different dimensionalities? It turns out that we can, using the mathematical apparatus that we have been developing in the last few chapters. Where we will end up is the following generalized Stokes' theorem:
(38.7) C d Ω ~ = C Ω ~ (38.7) C d Ω ~ = C Ω ~ {:(38.7)int_(C)d tilde(Omega)=int_(del C) tilde(Omega):}\begin{equation*} \int_{C} d \tilde{\boldsymbol{\Omega}}=\int_{\partial C} \tilde{\boldsymbol{\Omega}} \tag{38.7} \end{equation*}(38.7)CdΩ~=CΩ~
This is an integral over a space C C CCC of the exterior derivative of an n n nnn-form Ω ~ Ω ~ tilde(Omega)\tilde{\boldsymbol{\Omega}}Ω~ is equal to the integral over the boundary of a space C C CCC, written as C C del C\partial CC, of the n n nnn-form itself. Notice that it expresses exactly the intuition we have gleaned from our previous examples that were specific to particular dimensions. This new equation works in all directions and has buried
inside it all the other vector theorems we have just discussed (and with which we suspect most readers will be familiar).
To make progress, and to reach our generalized Stokes' theorem (eqn 38.7), we will need to upgrade our notation. Consider a typical line integral, such as the one used to compute the work done by a force F F F\boldsymbol{F}F in a three-dimensional Euclidean space expressed in Cartesian coordinates, 3 3 ^(3){ }^{3}3
(38.8) I 1 = C F d r = C F x d x + F y d y + F z d z (38.8) I 1 = C F d r = C F x d x + F y d y + F z d z {:(38.8)I_(1)=int_(C)F*dr=int_(C)F_(x)dx+F_(y)dy+F_(z)dz:}\begin{equation*} I_{1}=\int_{C} \boldsymbol{F} \cdot \mathrm{~d} \boldsymbol{r}=\int_{C} F_{x} \mathrm{~d} x+F_{y} \mathrm{~d} y+F_{z} \mathrm{~d} z \tag{38.8} \end{equation*}(38.8)I1=CF dr=CFx dx+Fy dy+Fz dz
This involves the integral of something over a one-dimensional path C C CCC. From our experience in the previous chapters, we recognize that the 'something' in this case resembles a 1 -form, which can be rewritten in geometric notation as α ~ = F x d x + F y d y + F z d z α ~ = F x d x + F y d y + F z d z tilde(alpha)=F_(x)dx+F_(y)dy+F_(z)dz\tilde{\boldsymbol{\alpha}}=F_{x} \boldsymbol{d} x+F_{y} \boldsymbol{d} y+F_{z} \boldsymbol{d} zα~=Fxdx+Fydy+Fzdz. We shall write this integral as
(38.9) I 1 = C α ~ (38.9) I 1 = C α ~ {:(38.9)I_(1)=int_(C) tilde(alpha):}\begin{equation*} I_{1}=\int_{C} \tilde{\boldsymbol{\alpha}} \tag{38.9} \end{equation*}(38.9)I1=Cα~
Admittedly, it looks as if we are making a terrible faux pas in writing an integral without including a term d (something), violating a rule that will have been drummed into most of us by successive teachers of mathematics. Nevertheless, this notation is correct as we understand this to be part of the differential form, as we will discuss below.
Next, consider a surface integral. Recalling from Chapters 32 and 37 that areas can be written as bivectors, we can write a typical surface integral of a function as
(38.10) I 2 = S G d S = S ( G x d y d z + G y d z d x + G z d x d y ) (38.10) I 2 = S G d S = S G x d y d z + G y d z d x + G z d x d y {:(38.10)I_(2)=int_(S)G*dS=int_(S)(G_(x)dy^^dz+G_(y)dz^^dx+G_(z)dx^^dy):}\begin{equation*} I_{2}=\int_{S} \boldsymbol{G} \cdot \mathrm{~d} \boldsymbol{S}=\int_{S}\left(G_{x} \boldsymbol{d} y \wedge \boldsymbol{d} z+G_{y} \boldsymbol{d} z \wedge \boldsymbol{d} x+G_{z} \boldsymbol{d} x \wedge \boldsymbol{d} y\right) \tag{38.10} \end{equation*}(38.10)I2=SG dS=S(Gxdydz+Gydzdx+Gzdxdy)
where the elements of the surface S S SSS have been recast using the wedge product. Written in this way, the integral involves the integral of a 2-form β ~ = G x d y d z + G y d z d x + G z d x d y β ~ = G x d y d z + G y d z d x + G z d x d y tilde(beta)=G_(x)dy^^dz+G_(y)dz^^dx+G_(z)dx^^dy\tilde{\boldsymbol{\beta}}=G_{x} \boldsymbol{d} y \wedge \boldsymbol{d} z+G_{y} \boldsymbol{d} z \wedge \boldsymbol{d} x+G_{z} \boldsymbol{d} x \wedge \boldsymbol{d} yβ~=Gxdydz+Gydzdx+Gzdxdy over a surface S S SSS :
(38.11) I 2 = S β ~ . (38.11) I 2 = S β ~ . {:(38.11)I_(2)=int_(S) tilde(beta).:}\begin{equation*} I_{2}=\int_{S} \tilde{\boldsymbol{\beta}} . \tag{38.11} \end{equation*}(38.11)I2=Sβ~.
In the remainder of this chapter, we generalize these observations and see how all multiple integrals can be understood as an integral of a form. We will start by investigating how this might work and why forms are the natural content of the integrand of an integral. We then go on to investigate the surfaces over which we integrate: these are members of a family of objects called chains that are dual to forms, which is to say we have an inner product-like relationship
(38.12) form , chain " = " ( multiple integral ) = (number ) . (38.12)  form  ,  chain  " = " (  multiple   integral  ) =  (number  . {:(38.12){:(:" form "," chain ":)"="((" multiple ")/(" integral "))=" (number ").:}\begin{equation*} \left.\langle\text { form }, \text { chain }\rangle "="\binom{\text { multiple }}{\text { integral }}=\text { (number }\right) . \tag{38.12} \end{equation*}(38.12) form , chain "="( multiple  integral )= (number ).
The rest of this chapter discusses this idea, leading eventually to our generalized version of Stokes' theorem, which is one of the milestones of differential geometry.
3 3 ^(3){ }^{3}3 We shall restrict our attention to flat space and spacetime in this chapter.

38.2 Integrating over forms

A 0 -form is a function. A 1-form looks like a series of planes. A 2-form looks like a set of crossed surfaces forming a series of parallel tubes. A 3 -form looks like an array of cells. These geometric objects can be used to provide the basic integrand of (function) × × xx\times× (integration cells) over which to integrate.
A definite integral outputs a number. A multiple integral is carried out over a generalized surface: a one-dimensional line, two-dimensional surface, three-dimensional volume or even a higher dimensional surface. Geometrically, it is the combination of a differential form and a surface that gives rise to the number. Specifically, the line integral evaluates the number of times the line crosses the surfaces of the 1-form. That is, for a 1 -form Y Y Y\boldsymbol{Y}Y, we have
(38.13) line Y ~ = ( number of surfaces cut ) (38.13) line  Y ~ = (  number of   surfaces cut  ) {:(38.13)int_("line ") tilde(Y)=((" number of ")/(" surfaces cut ")):}\begin{equation*} \int_{\text {line }} \tilde{\boldsymbol{Y}}=\binom{\text { number of }}{\text { surfaces cut }} \tag{38.13} \end{equation*}(38.13)line Y~=( number of  surfaces cut )
In the same way, the surface integral counts the number of tubes of the 2 -form that cut the 2 -surface. That is, for a 2 -form X ~ X ~ tilde(X)\tilde{\boldsymbol{X}}X~, we have
(38.14) surface X ~ = ( number of tubes cut ) (38.14) surface  X ~ = (  number of   tubes cut  ) {:(38.14)int_("surface ") tilde(X)=((" number of ")/(" tubes cut ")):}\begin{equation*} \int_{\text {surface }} \tilde{\boldsymbol{X}}=\binom{\text { number of }}{\text { tubes cut }} \tag{38.14} \end{equation*}(38.14)surface X~=( number of  tubes cut )
The volume integral evaluates the number of cells of the 3 -form contained by a 3 -volume, or, for a 3 -form W ~ W ~ tilde(W)\tilde{\boldsymbol{W}}W~, we have
(38.15) volume W ~ = ( number of cells contained ) (38.15) volume  W ~ = (  number of   cells contained  ) {:(38.15)int_("volume ") tilde(W)=((" number of ")/(" cells contained ")):}\begin{equation*} \int_{\text {volume }} \tilde{\boldsymbol{W}}=\binom{\text { number of }}{\text { cells contained }} \tag{38.15} \end{equation*}(38.15)volume W~=( number of  cells contained )
If we accept that integrals are indeed all carried out over forms, then we have a neat explanation for a potentially puzzling aspect of multiple integrals: the existence of the Jacobian. 4 4 ^(4){ }^{4}4 Recall how multiple integration is often introduced. We split a surface up into an array of infinitesimal cells and evaluate a function over the surface by summing its value on each of the cells. If we use a Cartesian coordinate system, a typical volume integral then looks like
(38.16) V f ( x , y , z ) d x d y d z (38.16) V f ( x , y , z ) d x d y d z {:(38.16)int_(V)f(x","y","z)dxdydz:}\begin{equation*} \int_{V} f(x, y, z) \mathrm{d} x \mathrm{~d} y \mathrm{~d} z \tag{38.16} \end{equation*}(38.16)Vf(x,y,z)dx dy dz
However, when we generalize to different coordinate systems, we need to include a factor of the Jacobian determinant J J JJJ, which guarantees that the cells defined by the new coordinates correctly mesh, allowing the volume to be covered without any gaps. Given two coordinate systems, with coordinates ( x 0 , , x n ) x 0 , , x n (x^(0),dots,x^(n))\left(x^{0}, \ldots, x^{n}\right)(x0,,xn) and ( y 0 , y n ) y 0 , y n (y^(0),dotsy^(n))\left(y^{0}, \ldots y^{n}\right)(y0,yn), the Jacobian determinant is defined as
(38.17) J = ( x 0 , , x n ) ( y 0 , , y n ) = | x 0 y 0 x 0 y n x n y 0 x n y n | (38.17) J = x 0 , , x n y 0 , , y n = x 0 y 0 x 0 y n x n y 0 x n y n {:(38.17)J=(del(x^(0),dots,x^(n)))/(del(y^(0),dots,y^(n)))=|[(delx^(0))/(dely^(0)),dots,(delx^(0))/(dely^(n))],[vdots,ddots,vdots],[(delx^(n))/(dely^(0)),cdots,(delx^(n))/(dely^(n))]|:}J=\frac{\partial\left(x^{0}, \ldots, x^{n}\right)}{\partial\left(y^{0}, \ldots, y^{n}\right)}=\left|\begin{array}{ccc} \frac{\partial x^{0}}{\partial y^{0}} & \ldots & \frac{\partial x^{0}}{\partial y^{n}} \tag{38.17}\\ \vdots & \ddots & \vdots \\ \frac{\partial x^{n}}{\partial y^{0}} & \cdots & \frac{\partial x^{n}}{\partial y^{n}} \end{array}\right|(38.17)J=(x0,,xn)(y0,,yn)=|x0y0x0ynxny0xnyn|
The determinant is an antisymmetric object and can be formed using 5 5 ^(5){ }^{5}5 the antisymmetric symbol ε μ α β γ ε μ α β γ epsi_(mu alpha beta gamma)\varepsilon_{\mu \alpha \beta \gamma}εμαβγ.
So how does the Jacobian factor arise? Let's investigate what happens to an element of surface, written as a 2 -form d α 1 × d α 2 d α 1 × d α 2 dalpha^(1)xx dalpha^(2)\boldsymbol{d} \alpha^{1} \times \boldsymbol{d} \alpha^{2}dα1×dα2, when a change of coordinates is made.
Example 38.1
Suppose we have a mesh given in terms of coordinates α 1 α 1 alpha^(1)\alpha^{1}α1 and α 2 α 2 alpha^(2)\alpha^{2}α2 and we want to write it in terms of coordinates x 1 x 1 x^(1)x^{1}x1 and x 2 x 2 x^(2)x^{2}x2. We write the basis 1 -forms in terms of the new coordinates as follows:
d α 1 = α 1 x 1 d x 1 + α 1 x 2 d x 2 (38.19) d α 2 = α 2 x 1 d x 1 + α 2 x 2 d x 2 d α 1 = α 1 x 1 d x 1 + α 1 x 2 d x 2 (38.19) d α 2 = α 2 x 1 d x 1 + α 2 x 2 d x 2 {:[dalpha^(1)=(delalpha^(1))/(delx^(1))dx^(1)+(delalpha^(1))/(delx^(2))dx^(2)],[(38.19)dalpha^(2)=(delalpha^(2))/(delx^(1))dx^(1)+(delalpha^(2))/(delx^(2))dx^(2)]:}\begin{align*} & \boldsymbol{d} \alpha^{1}=\frac{\partial \alpha^{1}}{\partial x^{1}} \boldsymbol{d} x^{1}+\frac{\partial \alpha^{1}}{\partial x^{2}} \boldsymbol{d} x^{2} \\ & \boldsymbol{d} \alpha^{2}=\frac{\partial \alpha^{2}}{\partial x^{1}} \boldsymbol{d} x^{1}+\frac{\partial \alpha^{2}}{\partial x^{2}} \boldsymbol{d} x^{2} \tag{38.19} \end{align*}dα1=α1x1dx1+α1x2dx2(38.19)dα2=α2x1dx1+α2x2dx2
In terms of the new coordinates, the areal element is
d α 1 d α 2 = ( α 1 x 1 d x 1 + α 1 x 2 d x 2 ) ( α 2 x 1 d x 1 + α 2 x 2 d x 2 ) = α 1 x 1 α 2 x 2 ( d x 1 d x 2 ) + α 1 x 2 α 2 x 1 ( d x 2 d x 1 ) = ( α 1 x 1 α 2 x 2 α 1 x 2 α 2 x 1 ) ( d x 1 d x 2 ) = ε μ ν α μ x 1 α ν x 2 d x 1 d x 2 (38.20) = ( α 1 , α 2 ) ( x 1 , x 2 ) d x 1 d x 2 . d α 1 d α 2 = α 1 x 1 d x 1 + α 1 x 2 d x 2 α 2 x 1 d x 1 + α 2 x 2 d x 2 = α 1 x 1 α 2 x 2 d x 1 d x 2 + α 1 x 2 α 2 x 1 d x 2 d x 1 = α 1 x 1 α 2 x 2 α 1 x 2 α 2 x 1 d x 1 d x 2 = ε μ ν α μ x 1 α ν x 2 d x 1 d x 2 (38.20) = α 1 , α 2 x 1 , x 2 d x 1 d x 2 . {:[dalpha^(1)^^dalpha^(2)=((delalpha^(1))/(delx^(1))*dx^(1)+(delalpha^(1))/(delx^(2))*dx^(2))^^((delalpha^(2))/(delx^(1))*dx^(1)+(delalpha^(2))/(delx^(2))*dx^(2))],[=(delalpha^(1))/(delx^(1))*(delalpha^(2))/(delx^(2))(dx^(1)^^dx^(2))+(delalpha^(1))/(delx^(2))*(delalpha^(2))/(delx^(1))(dx^(2)^^dx^(1))],[=((delalpha^(1))/(delx^(1))*(delalpha^(2))/(delx^(2))-(delalpha^(1))/(delx^(2))(delalpha^(2))/(delx^(1)))(dx^(1)^^dx^(2))],[=epsi_(mu nu)(delalpha^(mu))/(delx^(1))(delalpha^(nu))/(delx^(2))dx^(1)^^dx^(2)],[(38.20)=(del(alpha^(1),alpha^(2)))/(del(x^(1),x^(2)))*dx^(1)^^dx^(2).]:}\begin{align*} \boldsymbol{d} \alpha^{1} \wedge \boldsymbol{d} \alpha^{2} & =\left(\frac{\partial \alpha^{1}}{\partial x^{1}} \cdot \boldsymbol{d} x^{1}+\frac{\partial \alpha^{1}}{\partial x^{2}} \cdot \boldsymbol{d} x^{2}\right) \wedge\left(\frac{\partial \alpha^{2}}{\partial x^{1}} \cdot \boldsymbol{d} x^{1}+\frac{\partial \alpha^{2}}{\partial x^{2}} \cdot \boldsymbol{d} x^{2}\right) \\ & =\frac{\partial \alpha^{1}}{\partial x^{1}} \cdot \frac{\partial \alpha^{2}}{\partial x^{2}}\left(\boldsymbol{d} x^{1} \wedge \boldsymbol{d} x^{2}\right)+\frac{\partial \alpha^{1}}{\partial x^{2}} \cdot \frac{\partial \alpha^{2}}{\partial x^{1}}\left(\boldsymbol{d} x^{2} \wedge \boldsymbol{d} x^{1}\right) \\ & =\left(\frac{\partial \alpha^{1}}{\partial x^{1}} \cdot \frac{\partial \alpha^{2}}{\partial x^{2}}-\frac{\partial \alpha^{1}}{\partial x^{2}} \frac{\partial \alpha^{2}}{\partial x^{1}}\right)\left(\boldsymbol{d} x^{1} \wedge \boldsymbol{d} x^{2}\right) \\ & =\varepsilon_{\mu \nu} \frac{\partial \alpha^{\mu}}{\partial x^{1}} \frac{\partial \alpha^{\nu}}{\partial x^{2}} \boldsymbol{d} x^{1} \wedge \boldsymbol{d} x^{2} \\ & =\frac{\partial\left(\alpha^{1}, \alpha^{2}\right)}{\partial\left(x^{1}, x^{2}\right)} \cdot \boldsymbol{d} x^{1} \wedge \boldsymbol{d} x^{2} . \tag{38.20} \end{align*}dα1dα2=(α1x1dx1+α1x2dx2)(α2x1dx1+α2x2dx2)=α1x1α2x2(dx1dx2)+α1x2α2x1(dx2dx1)=(α1x1α2x2α1x2α2x1)(dx1dx2)=εμναμx1ανx2dx1dx2(38.20)=(α1,α2)(x1,x2)dx1dx2.
Where we recognize the Jacobian determinant J J JJJ in the final line.
We conclude that the Jacobian determinant is generated naturally by the algebra of forms.

38.3 Anatomy of an integral

We saw in the last section how a surface and a form are combined to give a number. This is reminiscent of how a 1 -form α ~ α ~ tilde(alpha)\tilde{\boldsymbol{\alpha}}α~ and its dual, a vector v v v\boldsymbol{v}v, are combined to make a number. This, of course, is done using an inner product α ~ , v α ~ , v (: tilde(alpha),v:)\langle\tilde{\boldsymbol{\alpha}}, \boldsymbol{v}\rangleα~,v. By examining the anatomy of a multiple integral, we shall see how a form can be combined in an inner product with its dual in this context: the generalized surface. This inner product gives us our integral.
A general multiple integral can be written as 6 6 ^(6){ }^{6}6
(38.21) I = S ( λ 1 , , λ n ) f ( x 1 , , x n ) ( x 1 , , x n ) ( λ 1 , , λ n ) d λ 1 , , d λ n (38.21) I = S λ 1 , , λ n f x 1 , , x n x 1 , , x n λ 1 , , λ n d λ 1 , , d λ n {:(38.21)I=int_(S(lambda^(1),dots,lambda^(n)))f(x^(1),dots,x^(n))(del(x^(1),dots,x^(n)))/(del(lambda^(1),dots,lambda^(n)))dlambda^(1)","dots","dlambda^(n):}\begin{equation*} \mathcal{I}=\int_{\mathcal{S}\left(\lambda^{1}, \ldots, \lambda^{n}\right)} f\left(x^{1}, \ldots, x^{n}\right) \frac{\partial\left(x^{1}, \ldots, x^{n}\right)}{\partial\left(\lambda^{1}, \ldots, \lambda^{n}\right)} \mathrm{d} \lambda^{1}, \ldots, \mathrm{~d} \lambda^{n} \tag{38.21} \end{equation*}(38.21)I=S(λ1,,λn)f(x1,,xn)(x1,,xn)(λ1,,λn)dλ1,, dλn
6 6 ^(6){ }^{6}6 We label coordinates here starting from 1 (rather than zero, as we have been doing for (3+1)-dimensional spacetime).
5 5 ^(5){ }^{5}5 Specifically, we can also write J J JJJ as a
sum
J = ε i 0 i 1 i n x i 0 y 0 x i 1 y 1 x i n y n J = ε i 0 i 1 i n x i 0 y 0 x i 1 y 1 x i n y n J=epsi_(i_(0)i_(1)dotsi_(n))(delx^(i_(0)))/(dely^(0))(delx^(i_(1)))/(dely^(1))cdots(delx^(i_(n)))/(dely^(n))J=\varepsilon_{i_{0} i_{1} \ldots i_{n}} \frac{\partial x^{i_{0}}}{\partial y^{0}} \frac{\partial x^{i_{1}}}{\partial y^{1}} \cdots \frac{\partial x^{i_{n}}}{\partial y^{n}}J=εi0i1inxi0y0xi1y1xinyn. (38.18)
This says that there's a surface S S S\mathcal{S}S parametrized by some variables λ i λ i lambda^(i)\lambda^{i}λi. We integrate a function f ( x 1 , , x n ) f x 1 , , x n f(x^(1),dots,x^(n))f\left(x^{1}, \ldots, x^{n}\right)f(x1,,xn) by summing the function multiplied by a surface element d x 1 d x n d x 1 d x n dx^(1)dotsdx^(n)\mathrm{d} x^{1} \ldots \mathrm{~d} x^{n}dx1 dxn written in the function's coordinates. The
Jacobian guarantees that the surface elements (given in terms of x i x i x^(i)x^{i}xi ) mesh to cover the surface (given in terms of λ i λ i lambda^(i)\lambda^{i}λi ). The key points of this section are that:
  • The function and surface element together can be represented as an n n nnn-form α ~ α ~ tilde(alpha)\tilde{\boldsymbol{\alpha}}α~;
  • The surface can be represented as a n n nnn-vector S S S\boldsymbol{S}S;
  • The integrand, complete with Jacobian, can be recreated by taking the inner product of the n n nnn-form α ~ α ~ tilde(alpha)\tilde{\boldsymbol{\alpha}}α~ and n n nnn-vector S S S\boldsymbol{S}S;
  • Geometrically, the integral is an inner product that counts the number of n n nnn-cells representing α ~ α ~ tilde(alpha)\tilde{\boldsymbol{\alpha}}α~, contained in the n n nnn-parallelepiped that represents S S SSS.
    Let's break this up into parts. We first look at the function and surface element that is contained in the form α ~ α ~ tilde(alpha)\tilde{\boldsymbol{\alpha}}α~, which looks like
(38.22) α ~ = f ( x 1 , , x n ) d x 1 d x n (38.22) α ~ = f x 1 , , x n d x 1 d x n {:(38.22) tilde(alpha)=f(x^(1),dots,x^(n))dx^(1)^^dots^^dx^(n):}\begin{equation*} \tilde{\boldsymbol{\alpha}}=f\left(x^{1}, \ldots, x^{n}\right) \boldsymbol{d} x^{1} \wedge \ldots \wedge \boldsymbol{d} x^{n} \tag{38.22} \end{equation*}(38.22)α~=f(x1,,xn)dx1dxn
This clearly falls apart into (i) a function and (ii) a surface element built from forms. The form in the surface element has the necessary properties to naturally give the Jacobian matrix required for our choice of coordinates. In the next example, we remind ourselves how to represent a 4 -volume and 3 -volume in terms of forms.

Example 38.2

Working in (3+1)-dimensional flat space with coordinates ( t , x , y , z ) ( t , x , y , z ) (t,x,y,z)(t, x, y, z)(t,x,y,z), we can interpret the 4 -volume element as a volume 4 -form 7 7 ^(7){ }^{7}7
ω ~ ( , , , ) = 1 4 ! ε μ ν α β d x μ d x ν d x α d x β = ε | μ ν α β | d x μ d x ν d x α d x β = ε 0123 d t d x d y d z (38.23) = d t d x d y d z ω ~ ( , , , ) = 1 4 ! ε μ ν α β d x μ d x ν d x α d x β = ε | μ ν α β | d x μ d x ν d x α d x β = ε 0123 d t d x d y d z (38.23) = d t d x d y d z {:[ tilde(omega)(","","",")=(1)/(4!)*epsi_(mu nu alpha beta)dx^(mu)^^dx^(nu)^^dx^(alpha)^^dx^(beta)],[=epsi_(|mu nu alpha beta|)dx^(mu)^^dx^(nu)^^dx^(alpha)^^dx^(beta)],[=epsi_(0123)dt^^dx^^dy^^dz],[(38.23)=dt^^dx^^dy^^dz]:}\begin{align*} \tilde{\boldsymbol{\omega}}(,,,) & =\frac{1}{4!} \cdot \varepsilon_{\mu \nu \alpha \beta} \boldsymbol{d} x^{\mu} \wedge \boldsymbol{d} x^{\nu} \wedge \boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta} \\ & =\varepsilon_{|\mu \nu \alpha \beta|} \boldsymbol{d} x^{\mu} \wedge \boldsymbol{d} x^{\nu} \wedge \boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta} \\ & =\varepsilon_{0123} \boldsymbol{d} t \wedge \boldsymbol{d} x \wedge \boldsymbol{d} y \wedge \boldsymbol{d} z \\ & =\boldsymbol{d} t \wedge \boldsymbol{d} x \wedge \boldsymbol{d} y \wedge \boldsymbol{d} z \tag{38.23} \end{align*}ω~(,,,)=14!εμναβdxμdxνdxαdxβ=ε|μναβ|dxμdxνdxαdxβ=ε0123dtdxdydz(38.23)=dtdxdydz
8 8 ^(8){ }^{8}8 Recall that this notation forces α < α < alpha < quad\alpha<\quadα< where we've used the | α β γ δ | | α β γ δ | |alpha beta gamma delta||\alpha \beta \gamma \delta||αβγδ| notation. 8 8 ^(8){ }^{8}8 (One message here is that we can leave the β < γ < δ β < γ < δ beta < gamma < delta\beta<\gamma<\deltaβ<γ<δ.
9 9 ^(9){ }^{9}9 The 3-form d Σ ~ μ d Σ ~ μ d tilde(Sigma)_(mu)\mathrm{d} \tilde{\boldsymbol{\Sigma}}_{\mu}dΣ~μ is dual to a basis vector e μ e μ e_(mu)\boldsymbol{e}_{\mu}eμ, i.e. d Σ ~ μ = e μ d Σ ~ μ = e μ d tilde(Sigma)_(mu)=***e_(mu)\mathrm{d} \tilde{\boldsymbol{\Sigma}}_{\mu}=\star \boldsymbol{e}_{\mu}dΣ~μ=eμ. If we fill the three slots of the tensor d Σ ~ μ d Σ ~ μ d tilde(Sigma)_(mu)\mathrm{d} \tilde{\boldsymbol{\Sigma}}_{\mu}dΣ~μ, we obtain three slots of the tensor d Σ μ d Σ μ dSigma_(mu)\mathrm{d} \Sigma_{\mu}dΣμ, we obtain
the components of the 3 -volume 1-form d σ ~ ( e μ ) d σ ~ e μ d tilde(sigma)(e_(mu))\mathrm{d} \tilde{\boldsymbol{\sigma}}\left(\boldsymbol{e}_{\mu}\right)dσ~(eμ) from the last chapter.
where we ve used the α β α β alpha beta\alpha \betaαβ 倍 sum unrestricted and include the factor 4 !, or restrict the sum.)
In the same way, the 3 -volume element can be provided by a 3 -form 9 9 ^(9){ }^{9}9
d Σ ~ μ ( , , ) = 1 3 ! ε μ α β γ d x α d x β d x γ (38.24) = ε μ | α β γ | d x α d x β d x γ d Σ ~ μ ( , , ) = 1 3 ! ε μ α β γ d x α d x β d x γ (38.24) = ε μ | α β γ | d x α d x β d x γ {:[d tilde(Sigma)_(mu)(","",")=(1)/(3!)epsi_(mu alpha beta gamma)dx^(alpha)^^dx^(beta)^^dx^(gamma)],[(38.24)=epsi_(mu|alpha beta gamma|)dx^(alpha)^^dx^(beta)^^dx^(gamma)]:}\begin{align*} \mathrm{d} \tilde{\boldsymbol{\Sigma}}_{\mu}(,,) & =\frac{1}{3!} \varepsilon_{\mu \alpha \beta \gamma} \boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta} \wedge \boldsymbol{d} x^{\gamma} \\ & =\varepsilon_{\mu|\alpha \beta \gamma|} \boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta} \wedge \boldsymbol{d} x^{\gamma} \tag{38.24} \end{align*}dΣ~μ(,,)=13!εμαβγdxαdxβdxγ(38.24)=εμ|αβγ|dxαdxβdxγ
In many applications, we shall integrate 3-form objects written as J μ d Σ ~ μ J μ d Σ ~ μ J^(mu)d tilde(Sigma)_(mu)J^{\mu} \mathrm{d} \tilde{\boldsymbol{\Sigma}}_{\mu}JμdΣ~μ over some surface. In terms of forms, this is to be interpreted as
J μ d Σ ~ μ = 1 3 ! J μ ε μ α β γ d x α d x β d x γ (38.25) = J μ ε μ | α β γ | d x α d x β d x γ J μ d Σ ~ μ = 1 3 ! J μ ε μ α β γ d x α d x β d x γ (38.25) = J μ ε μ | α β γ | d x α d x β d x γ {:[J^(mu)d tilde(Sigma)_(mu)=(1)/(3!)J^(mu)epsi_(mu alpha beta gamma)dx^(alpha)^^dx^(beta)^^dx^(gamma)],[(38.25)=J^(mu)epsi_(mu|alpha beta gamma|)dx^(alpha)^^dx^(beta)^^dx^(gamma)]:}\begin{align*} J^{\mu} d \tilde{\boldsymbol{\Sigma}}_{\mu} & =\frac{1}{3!} J^{\mu} \varepsilon_{\mu \alpha \beta \gamma} \boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta} \wedge \boldsymbol{d} x^{\gamma} \\ & =J^{\mu} \varepsilon_{\mu|\alpha \beta \gamma|} \boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta} \wedge \boldsymbol{d} x^{\gamma} \tag{38.25} \end{align*}JμdΣ~μ=13!Jμεμαβγdxαdxβdxγ(38.25)=Jμεμ|αβγ|dxαdxβdxγ
With the integrand identified as a form, we now attempt to describe the generalized surface as an n n nnn-vector. This is possible since, just as n n nnn-forms are built from wedge products of 1-forms, the generalized surface over
which we integrate can be built from wedge products of vectors. Let's consider doing a general multiple integral
(38.26) I = S α ~ (38.26) I = S α ~ {:(38.26)I=int_(S) tilde(alpha):}\begin{equation*} I=\int_{\mathcal{S}} \tilde{\boldsymbol{\alpha}} \tag{38.26} \end{equation*}(38.26)I=Sα~
where the form α ~ α ~ tilde(alpha)\tilde{\boldsymbol{\alpha}}α~ is integrated over the n n nnn-dimensional surface S S S\mathcal{S}S. To see how to construct an element of generalized n n nnn-dimensional surface S S S\mathcal{S}S, consider the vicinity of a points P P P\mathcal{P}P on a surface P ( λ 1 , λ 2 , , λ n ) P λ 1 , λ 2 , , λ n P(lambda^(1),lambda^(2),dots,lambda^(n))\mathcal{P}\left(\lambda^{1}, \lambda^{2}, \ldots, \lambda^{n}\right)P(λ1,λ2,,λn), where λ i λ i lambda^(i)\lambda^{i}λi are the coordinates parametrizing the surface. A small displacement in the λ i λ i lambda^(i)\lambda^{i}λi direction is written as ( P / λ i ) Δ λ i P / λ i Δ λ i (delP//dellambda^(i))Deltalambda^(i)\left(\partial \mathcal{P} / \partial \lambda^{i}\right) \Delta \lambda^{i}(P/λi)Δλi, which is formed from a vector P / λ i P / λ i delP//dellambda^(i)\partial \mathcal{P} / \partial \lambda^{i}P/λi and infinitesimal length element Δ λ i Δ λ i Deltalambda^(i)\Delta \lambda^{i}Δλi. In building a surface, we can take the wedge products of the vector parts to form a generalized n n nnn-dimensional parallelepiped. Therefore, tangent to the surface S S S\mathcal{S}S at point P P P\mathcal{P}P is an infinitesimal parallelepiped built from vectors
P λ 1 Δ λ 1 P λ 2 Δ λ 2 P λ n Δ λ n (38.27) = ( P λ 1 P λ 2 P λ n ) Δ λ 1 Δ λ 2 Δ λ n P λ 1 Δ λ 1 P λ 2 Δ λ 2 P λ n Δ λ n (38.27) = P λ 1 P λ 2 P λ n Δ λ 1 Δ λ 2 Δ λ n {:[(delP)/(dellambda^(1))Deltalambda^(1)^^(delP)/(dellambda^(2))*Deltalambda^(2)^^dots^^(delP)/(dellambda^(n))*Deltalambda^(n)],[(38.27)=((delP)/(dellambda^(1))^^(delP)/(dellambda^(2))^^dots^^(delP)/(dellambda^(n)))Deltalambda^(1)Deltalambda^(2)dots Deltalambda^(n)]:}\begin{align*} \frac{\partial \mathcal{P}}{\partial \lambda^{1}} \Delta \lambda^{1} \wedge \frac{\partial \mathcal{P}}{\partial \lambda^{2}} \cdot \Delta \lambda^{2} \wedge \ldots \wedge & \frac{\partial \mathcal{P}}{\partial \lambda^{n}} \cdot \Delta \lambda^{n} \\ & =\left(\frac{\partial \mathcal{P}}{\partial \lambda^{1}} \wedge \frac{\partial \mathcal{P}}{\partial \lambda^{2}} \wedge \ldots \wedge \frac{\partial \mathcal{P}}{\partial \lambda^{n}}\right) \Delta \lambda^{1} \Delta \lambda^{2} \ldots \Delta \lambda^{n} \tag{38.27} \end{align*}Pλ1Δλ1Pλ2Δλ2PλnΔλn(38.27)=(Pλ1Pλ2Pλn)Δλ1Δλ2Δλn
To perform the integral, we evaluate how many infinitesimal cells represented by the integration form α ~ α ~ tilde(alpha)\tilde{\boldsymbol{\alpha}}α~ are cut by the surface. Recall that the machine that works this out is the inner product, 10 10 ^(10){ }^{10}10 since this tells us how a n n nnn-form and n n nnn-vector mesh together. 11 11 ^(11){ }^{11}11 We therefore interpret the integral symbol as follows:
(38.30) S α ~ = α ~ , P λ 1 P λ 2 P λ n d λ 1 d λ 2 d λ n (38.30) S α ~ = α ~ , P λ 1 P λ 2 P λ n d λ 1 d λ 2 d λ n {:(38.30)int_(S) tilde(alpha)=∬dots int(:( tilde(alpha)),(delP)/(dellambda^(1))^^(delP)/(dellambda^(2))^^dots^^(delP)/(dellambda^(n)):)dlambda^(1)dlambda^(2)dotsdlambda^(n):}\begin{equation*} \int_{\mathcal{S}} \tilde{\boldsymbol{\alpha}}=\iint \ldots \int\left\langle\tilde{\boldsymbol{\alpha}}, \frac{\partial \mathcal{P}}{\partial \lambda^{1}} \wedge \frac{\partial \mathcal{P}}{\partial \lambda^{2}} \wedge \ldots \wedge \frac{\partial \mathcal{P}}{\partial \lambda^{n}}\right\rangle \mathrm{d} \lambda^{1} \mathrm{~d} \lambda^{2} \ldots \mathrm{~d} \lambda^{n} \tag{38.30} \end{equation*}(38.30)Sα~=α~,Pλ1Pλ2Pλndλ1 dλ2 dλn
Let's see how this generates the Jacobian. Consider a basis forms d x 1 , , d x n d x 1 , , d x n dx^(1),dots,dx^(n)\boldsymbol{d} x^{1}, \ldots, \boldsymbol{d} x^{n}dx1,,dxn such that the form is written as
(38.31) α = α ( x 1 , , x n ) d x 1 d x 2 d x n (38.31) α = α x 1 , , x n d x 1 d x 2 d x n {:(38.31)alpha=alpha(x^(1),dots,x^(n))dx^(1)^^dx^(2)^^dots^^dx^(n):}\begin{equation*} \boldsymbol{\alpha}=\alpha\left(x^{1}, \ldots, x^{n}\right) \boldsymbol{d} x^{1} \wedge \boldsymbol{d} x^{2} \wedge \ldots \wedge \boldsymbol{d} x^{n} \tag{38.31} \end{equation*}(38.31)α=α(x1,,xn)dx1dx2dxn
then we have, in the inner product,
(38.32) d x 1 d x 2 d x n , P λ 1 P λ 2 P λ n = ( x 1 , x 2 , , x n ) ( λ 1 , λ 2 , , λ n ) (38.32) d x 1 d x 2 d x n , P λ 1 P λ 2 P λ n = x 1 , x 2 , , x n λ 1 , λ 2 , , λ n {:(38.32)(:dx^(1)^^dx^(2)^^dots^^dx^(n),(delP)/(dellambda^(1))^^(delP)/(dellambda^(2))^^dots^^(delP)/(dellambda^(n)):)=(del(x^(1),x^(2),dots,x^(n)))/(del(lambda^(1),lambda^(2),dots,lambda^(n))):}\begin{equation*} \left\langle\boldsymbol{d} x^{1} \wedge \boldsymbol{d} x^{2} \wedge \ldots \wedge \boldsymbol{d} x^{n}, \frac{\partial \mathcal{P}}{\partial \lambda^{1}} \wedge \frac{\partial \mathcal{P}}{\partial \lambda^{2}} \wedge \ldots \wedge \frac{\partial \mathcal{P}}{\partial \lambda^{n}}\right\rangle=\frac{\partial\left(x^{1}, x^{2}, \ldots, x^{n}\right)}{\partial\left(\lambda^{1}, \lambda^{2}, \ldots, \lambda^{n}\right)} \tag{38.32} \end{equation*}(38.32)dx1dx2dxn,Pλ1Pλ2Pλn=(x1,x2,,xn)(λ1,λ2,,λn)
We check this gives the expected answer for the simplest possible case in the next example.

Example 38.3

Consider a line integral curve d f curve  d f int_("curve ")df\int_{\text {curve }} \boldsymbol{d} fcurve df. This can be done by inspection: the answer is simply the difference in the values of the function f f fff at the start and end of the curve. 12 12 ^(12){ }^{12}12 We parametrize the curve with 0 λ 1 0 λ 1 0 <= lambda <= 10 \leq \lambda \leq 10λ1 and we have
curve d f = 0 1 d f , P λ d λ (38.33) = 0 1 d f d λ d λ = f [ P ( 1 ) ] f [ P ( 0 ) ] curve  d f = 0 1 d f , P ¯ λ d λ (38.33) = 0 1 d f d λ d λ = f [ P ( 1 ) ] f [ P ( 0 ) ] {:[int_("curve ")df=int_(0)^(1)(:df,( bar(P))/(del lambda):)dlambda],[(38.33)=int_(0)^(1)((d)f)/((d)lambda)dlambda=f[P(1)]-f[P(0)]]:}\begin{align*} \int_{\text {curve }} \boldsymbol{d} f & =\int_{0}^{1}\left\langle\boldsymbol{d} f, \frac{\overline{\mathcal{P}}}{\partial \lambda}\right\rangle \mathrm{d} \lambda \\ & =\int_{0}^{1} \frac{\mathrm{~d} f}{\mathrm{~d} \lambda} \mathrm{~d} \lambda=f[\mathcal{P}(1)]-f[\mathcal{P}(0)] \tag{38.33} \end{align*}curve df=01df,Pλdλ(38.33)=01 df dλ dλ=f[P(1)]f[P(0)]
10 10 ^(10){ }^{10}10 Consider a 2 -form F ~ F ~ tilde(F)\tilde{\boldsymbol{F}}F~ and a bivector u v u v u^^v\boldsymbol{u} \wedge \boldsymbol{v}uv. The contraction
(38.28) F ~ , u v (38.28) F ~ , u v {:(38.28)(: tilde(F)","u^^v:):}\begin{equation*} \langle\tilde{\boldsymbol{F}}, \boldsymbol{u} \wedge \boldsymbol{v}\rangle \tag{38.28} \end{equation*}(38.28)F~,uv
tells us to evaluate how much many tubes of F ~ F ~ tilde(F)\tilde{\boldsymbol{F}}F~ are cut by the bivector u v u v u^^v\boldsymbol{u} \wedge \boldsymbol{v}uv. Looking ahead, we put the material in this section to work in Chapter 43 by identifying an infinitesimal surface Δ x e x Δ y e y Δ x e x Δ y e y Delta xe_(x)^^Delta ye_(y)\Delta x e_{x} \wedge \Delta y e_{y}ΔxexΔyey and evaluating the integral
Δ x e x Δ y e y F ~ = F ~ , e x e y Δ x Δ y = F ~ ( e x , e y ) Δ x Δ y = F x y Δ x Δ y . Δ x e x Δ y e y F ~ = F ~ , e x e y Δ x Δ y = F ~ e x , e y Δ x Δ y = F x y Δ x Δ y . {:[int_(Delta xe_(x)^^Delta ye_(y)) tilde(F)],[=int(:( tilde(F)),e_(x)^^e_(y):)Delta x Delta y],[=int tilde(F)(e_(x),e_(y))Delta x Delta y],[=intF_(xy)Delta x Delta y.]:}\begin{aligned} & \int_{\Delta x \boldsymbol{e}_{x} \wedge \Delta y \boldsymbol{e}_{y}} \tilde{\boldsymbol{F}} \\ &=\int\left\langle\tilde{\boldsymbol{F}}, \boldsymbol{e}_{x} \wedge \boldsymbol{e}_{y}\right\rangle \Delta x \Delta y \\ &=\int \tilde{\boldsymbol{F}}\left(\boldsymbol{e}_{x}, \boldsymbol{e}_{y}\right) \Delta x \Delta y \\ &=\int F_{x y} \Delta x \Delta y . \end{aligned}ΔxexΔyeyF~=F~,exeyΔxΔy=F~(ex,ey)ΔxΔy=FxyΔxΔy.
See Chapter 43 for the details.
11 11 ^(11){ }^{11}11 Remember the rule that α ~ , v = α ~ , v = (: tilde(alpha),v:)=\langle\tilde{\boldsymbol{\alpha}}, \boldsymbol{v}\rangle=α~,v= α | i 1 i n | v i 1 i n α i 1 i n v i 1 i n alpha_(|i_(1)dotsi_(n)|)v^(i_(1)dotsi_(n))\alpha_{\left|i_{1} \ldots i_{n}\right|} v^{i_{1} \ldots i_{n}}α|i1in|vi1in.
12 12 ^(12){ }^{12}12 This integral is independent of the path taken by the curve, showing that the form d f d f df\boldsymbol{d} fdf corresponds to a conservative force field. We can generalize that the integral of an exact 1 -form d f d f dfd fdf is path independent, and given by the change in the function f f fff along the curve.
Fig. 38.1 Boundaries for several of ob jects. In some cases, for spaces with no boundary, we generate the empty set (shown schematically as the space vanishing 'in a puff of logic', in Douglas Adams' memorable phrase). This figure illustrates some of the examples given in Example 38.4
13 13 ^(13){ }^{13}13 The concept of chains comes from algebraic topology, where the chain group is an algebraic structure composed by constructing linear combinations of 'simplicial complexes', the idea being that a topological space can be constructed by gluing together little triangles or tetrahedra, or other highdimensional basic objects (the simpli cial complexes). For more details, see e.g. Nakahara (1990), but this algebraic treatment is not important for our purposes here and we can just think of chains simply as topological spaces.
We recover the expected result.
We conclude that an integral is an inner product between a form and a surface. Since it outputs a number, the surfaces must be dual to the forms. In the next section, we examine the properties of the surfaces in more detail.

38.4 Boundaries and chains

Here we examine the properties of the generalized surfaces we integrate over. First a definition: the boundary M M del M\partial MM of a region of M M MMM, consists of those points of M M MMM that do not lie in the interior.
Example 38.4
The boundary of the closed unit disc is the unit circle.
The boundary of a finite cylinder is two circles.
The boundary of a line is its two end points.
The boundary of the unit sphere is empty (i.e. there isn't one).
We regard del\partial as an operator that extracts the boundary from a space M M MMM. Some examples of the use of this operator are shown in Fig. 38.1.
With boundaries at our disposal we turn to the main topic of this section. The generalized surfaces over which we integrate form a family of objects we call chains. 13 13 ^(13){ }^{13}13 As described above, forms are dual to chains, in that we combine them to output a number. We define and denote our chains as follows:
(38.34) C 0 0 -chain = point C 1 1-chain = line C 2 2-chain = area (or 2 -volume) C 3 3 -chain = 3 -volume C n n -chain = n -volume (38.34) C 0 0 -chain  =  point  C 1  1-chain  =  line  C 2  2-chain  =  area (or  2 -volume)  C 3 3 -chain  = 3 -volume  C n n -chain  = n -volume  {:(38.34){:[C_(0),0"-chain "=" point "],[C_(1)," 1-chain "=" line "],[C_(2)," 2-chain "=" area (or "2"-volume) "],[C_(3),3"-chain "=3"-volume "],[C_(n)quad n"-chain "=n"-volume "]:}:}\begin{array}{ll} C_{0} & 0 \text {-chain }=\text { point } \\ C_{1} & \text { 1-chain }=\text { line } \\ C_{2} & \text { 2-chain }=\text { area (or } 2 \text {-volume) } \tag{38.34}\\ C_{3} & 3 \text {-chain }=3 \text {-volume } \\ C_{n} \quad n \text {-chain }=n \text {-volume } \end{array}(38.34)C00-chain = point C1 1-chain = line C2 2-chain = area (or 2-volume) C33-chain =3-volume Cnn-chain =n-volume 
The boundary operator del\partial acts on the chains, mapping a chain C n C n C_(n)C_{n}Cn on to a chain C n 1 C n 1 C_(n-1)C_{n-1}Cn1. That is, it inputs an n n nnn-chain and outputs an ( n 1 ) ( n 1 ) (n-1)(n-1)(n1)-chain. We have, therefore, that
(38.35) C n = C n 1 (38.35) C n = C n 1 {:(38.35)delC_(n)=C_(n-1):}\begin{equation*} \partial C_{n}=C_{n-1} \tag{38.35} \end{equation*}(38.35)Cn=Cn1
Some chains have no boundaries. A closed line like a circle has no boundary. We call these boundary-free chains closed chains or cycles, often denoted Z n Z n Z_(n)Z_{n}Zn with the property Z n = 0 Z n = 0 delZ_(n)=0\partial Z_{n}=0Zn=0. Some chains are boundaries to higher dimensional chains. A closed surface is boundary to a volume; a closed line is boundary to a surface; the boundary of a line is two points. In short, the boundary of an n + 1 n + 1 n+1n+1n+1-chain is an n n nnn-chain. If B n B n B_(n)B_{n}Bn is the boundary of an ( n + 1 ) ( n + 1 ) (n+1)(n+1)(n+1)-chain we have
(38.36) B n = C n + 1 (38.36) B n = C n + 1 {:(38.36)B_(n)=delC_(n+1):}\begin{equation*} B_{n}=\partial C_{n+1} \tag{38.36} \end{equation*}(38.36)Bn=Cn+1
However, boundaries themselves are closed: they are boundaries, but don't themselves have boundaries. So any boundary has
(38.37) B n = 0 (38.37) B n = 0 {:(38.37)delB_(n)=0:}\begin{equation*} \partial B_{n}=0 \tag{38.37} \end{equation*}(38.37)Bn=0
Combining the two previous equations we see that C n + 1 = 0 C n + 1 = 0 del delC_(n+1)=0\partial \partial C_{n+1}=0Cn+1=0. Since there was nothing special about C n + 1 C n + 1 C_(n+1)C_{n+1}Cn+1 we conclude that
(38.38) = 2 = 0 (38.38) = 2 = 0 {:(38.38)del del=del^(2)=0:}\begin{equation*} \partial \partial=\partial^{2}=0 \tag{38.38} \end{equation*}(38.38)=2=0
or, in words, the boundary of a boundary is zero. 14 14 ^(14){ }^{14}14

38.5 Stokes' theorem

An integral involves the inner product of an n n nnn-chain and an n n nnn-form. The pinnacle of this description of calculus in terms of chains and forms is Stokes' theorem. 15 15 ^(15){ }^{15}15 In order to understand the content of the theorem, we need to bring in one final (but familiar) piece of technology: the exterior derivative d d d\boldsymbol{d}d. The key to Stokes' theorem is to note that just as the boundary operator del\partial takes a ( n + 1 ) ( n + 1 ) (n+1)(n+1)(n+1)-chain and outputs an n n nnn chain, the exterior derivative operator d d d\boldsymbol{d}d takes a n n nnn form and outputs a ( n + 1 n + 1 n+1n+1n+1 )-form.
Example 38.5
Recall that in three dimensions with basis 1 -forms d x , d y d x , d y dx,dy\boldsymbol{d} x, \boldsymbol{d} ydx,dy and d z d z dz\boldsymbol{d} zdz, we have the following forms:
0-form Z ~ = f 1-form Y = A d x + B d y + C d z 2-form X ~ = a d x d y + b d y d z + c d z d x 3-form W ~ = F d x d y d z  0-form       Z ~ = f  1-form       Y = A d x + B d y + C d z  2-form       X ~ = a d x d y + b d y d z + c d z d x  3-form       W ~ = F d x d y d z {:[" 0-form ", tilde(Z)=f],[" 1-form ",Y=Adx+Bdy+Cdz],[" 2-form ", tilde(X)=adx^^dy+bdy^^dz+cdz^^dx],[" 3-form ", tilde(W)=Fdx^^dy^^dz]:}\begin{array}{ll} \text { 0-form } & \tilde{\boldsymbol{Z}}=f \\ \text { 1-form } & \boldsymbol{\boldsymbol { Y }}=A \boldsymbol{d} x+B \boldsymbol{d} y+C \boldsymbol{d} z \\ \text { 2-form } & \tilde{\boldsymbol{X}}=a \boldsymbol{d} x \wedge \boldsymbol{d} y+b \boldsymbol{d} y \wedge \boldsymbol{d} z+c \boldsymbol{d} z \wedge \boldsymbol{d} x \\ \text { 3-form } & \tilde{\boldsymbol{W}}=F \boldsymbol{d} x \wedge \boldsymbol{d} y \wedge \boldsymbol{d} z \end{array} 0-form Z~=f 1-form Y=Adx+Bdy+Cdz 2-form X~=adxdy+bdydz+cdzdx 3-form W~=Fdxdydz
We can operate with the exterior derivative operator on the 1-form to make
d Y ~ = A y d y d x + A z d z d x + B x d x d y + B z d z d y + C x d x d z + C y d y d z = ( C y B z ) d y d z + ( A z C x ) d z d x (38.40) + ( B x A y ) d x d y d Y ~ = A y d y d x + A z d z d x + B x d x d y + B z d z d y + C x d x d z + C y d y d z = C y B z d y d z + A z C x d z d x (38.40) + B x A y d x d y {:[d tilde(Y)=(del A)/(del y)dy^^dx+(del A)/(del z)dz^^dx],[+(del B)/(del x)dx^^dy+(del B)/(del z)dz^^dy],[+(del C)/(del x)dx^^dz+(del C)/(del y)dy^^dz],[=((del C)/(del y)-(del B)/(del z))dy^^dz+((del A)/(del z)-(del C)/(del x))dz^^dx],[(38.40)+((del B)/(del x)-(del A)/(del y))dx^^dy]:}\begin{align*} \boldsymbol{d} \tilde{\boldsymbol{Y}} & =\frac{\partial A}{\partial y} \boldsymbol{d} y \wedge \boldsymbol{d} x+\frac{\partial A}{\partial z} \boldsymbol{d} z \wedge \boldsymbol{d} x \\ & +\frac{\partial B}{\partial x} \boldsymbol{d} x \wedge \boldsymbol{d} y+\frac{\partial B}{\partial z} \boldsymbol{d} z \wedge \boldsymbol{d} y \\ & +\frac{\partial C}{\partial x} \boldsymbol{d} x \wedge \boldsymbol{d} z+\frac{\partial C}{\partial y} \boldsymbol{d} y \wedge \boldsymbol{d} z \\ = & \left(\frac{\partial C}{\partial y}-\frac{\partial B}{\partial z}\right) \boldsymbol{d} y \wedge \boldsymbol{d} z+\left(\frac{\partial A}{\partial z}-\frac{\partial C}{\partial x}\right) \boldsymbol{d} z \wedge \boldsymbol{d} x \\ & +\left(\frac{\partial B}{\partial x}-\frac{\partial A}{\partial y}\right) \boldsymbol{d} x \wedge \boldsymbol{d} y \tag{38.40} \end{align*}dY~=Aydydx+Azdzdx+Bxdxdy+Bzdzdy+Cxdxdz+Cydydz=(CyBz)dydz+(AzCx)dzdx(38.40)+(BxAy)dxdy
However, if we operate a second time, we find d d Y ~ = 0 d d Y ~ = 0 dd tilde(Y)=0\boldsymbol{d} \boldsymbol{d} \tilde{\boldsymbol{Y}}=0ddY~=0. This property that two operations of the exterior derivative operator yields zero is a general one. Trying again by starting on the 2 -form, we find
(38.41) d X ~ = ( a z + b x + c y ) d x d y d z (38.41) d X ~ = a z + b x + c y d x d y d z {:(38.41)d tilde(X)=((del a)/(del z)+(del b)/(del x)+(del c)/(del y))dx^^dy^^dz:}\begin{equation*} \boldsymbol{d} \tilde{\boldsymbol{X}}=\left(\frac{\partial a}{\partial z}+\frac{\partial b}{\partial x}+\frac{\partial c}{\partial y}\right) \boldsymbol{d} x \wedge \boldsymbol{d} y \wedge \boldsymbol{d} z \tag{38.41} \end{equation*}(38.41)dX~=(az+bx+cy)dxdydz
but then d d X ~ = 0 d d X ~ = 0 dd tilde(X)=0\boldsymbol{d d} \tilde{\boldsymbol{X}}=0ddX~=0.
Each of the results from the last example are summarized with the expression, familiar from Chapter 33 that
(38.42) d d = 0 . (38.42) d d = 0 . {:(38.42)dd=0.:}\begin{equation*} d d=0 . \tag{38.42} \end{equation*}(38.42)dd=0.
16 16 ^(16){ }^{16}16 An n n nnn-form A ~ A ~ tilde(A)\tilde{\boldsymbol{A}}A~ is closed if d A ~ = 0 d A ~ = 0 d tilde(A)=0d \tilde{\boldsymbol{A}}=0dA~=0 An n n nnn-form is called exact if it is the derivative of an ( n 1 ) ( n 1 ) (n-1)(n-1)(n1)-form. All exact forms are clearly closed. Are all closed forms exact? That is, is the property that the exterior derivative gives zero enough to guarantee that we can write the form as the exterior derivative of another object. The Poincare lemma says that if d A ~ = 0 d A ~ = 0 d tilde(A)=0\boldsymbol{d} \tilde{\boldsymbol{A}}=0dA~=0 throughout a simply connected region of space, then A ~ = d B ~ A ~ = d B ~ tilde(A)=d tilde(B)\tilde{\boldsymbol{A}}=\boldsymbol{d} \tilde{B}A~=dB~ for some B ~ B ~ tilde(B)\tilde{B}B~ and so the form is exact. A space that is not simply connected is discussed in the exercises.
17 17 ^(17){ }^{17}17 This fussy definition really just means we need to define a consistent positive sense of pointing away from the volume C C CCC.
18 18 ^(18){ }^{18}18 Stokes' theorem is proved in Needham (2021) and, at a more advanced level, in Spivak's Calculus on Manifolds, both of which are highly recommended. Of Stokes' theorem, Spivak notes: '1. It is trivial. 2. It is trivial because the terms appearing in it have been properly defined. 3 . It has significant consequences.'
19 19 ^(19){ }^{19}19 Recall that in the coordinate frame, we need a factor [ ( 1 ) s det g μ ν ] 1 2 ( 1 ) s det g μ ν 1 2 [(-1)^(s)detg_(mu nu)]^((1)/(2))\left[(-1)^{s} \operatorname{det} g_{\mu \nu}\right]^{\frac{1}{2}}[(1)sdetgμν]12, where s s sss is the number of minuses in the metric signature. We'll assume here that, since we're looking at threedimensional space, s = 0 s = 0 s=0s=0s=0 and write det g μ ν = g det g μ ν = g detg_(mu nu)=g\operatorname{det} g_{\mu \nu}=gdetgμν=g.
This equation is the dual equation of 2 = 0 . 16 2 = 0 . 16 del^(2)=0.^(16)\partial^{2}=0 .{ }^{16}2=0.16
With the operators d d d\boldsymbol{d}d and del\partial in hand, we can write down Stokes' theorem. The content of the theorem reflects the ability to use the operators to equate the inner product of an ( n + 1 ) ( n + 1 ) (n+1)(n+1)(n+1)-chain C C CCC and ( n + 1 ) ( n + 1 ) (n+1)(n+1)(n+1)-form d Ω d Ω d Omega\boldsymbol{d} \boldsymbol{\Omega}dΩ, with that of an n n nnn-chain C C del C\partial CC and n n nnn-form Ω ~ Ω ~ tilde(Omega)\tilde{\boldsymbol{\Omega}}Ω~.
Stokes' theorem says that if Ω ~ Ω ~ tilde(Omega)\tilde{\boldsymbol{\Omega}}Ω~ is a n n nnn-form and C C CCC is a ( n + 1 ) ( n + 1 ) (n+1)(n+1)(n+1)-chain, then we have
(38.43) C Ω ~ = C d Ω ~ (38.43) C Ω ~ = C d Ω ~ {:(38.43)int_(del C) tilde(Omega)=int_(C)d tilde(Omega):}\begin{equation*} \int_{\partial C} \tilde{\Omega}=\int_{C} d \tilde{\Omega} \tag{38.43} \end{equation*}(38.43)CΩ~=CdΩ~
where the orientation of the surface C C del C\partial CC must be chosen 17 17 ^(17){ }^{17}17 such that if the orientation for C C CCC is
(38.44) P λ 1 P λ 2 P λ n (38.44) P λ 1 P λ 2 P λ n {:(38.44)(delP)/(dellambda^(1))^^(delP)/(dellambda^(2))^^dots^^(delP)/(dellambda^(n)):}\begin{equation*} \frac{\partial \mathcal{P}}{\partial \lambda^{1}} \wedge \frac{\partial \mathcal{P}}{\partial \lambda^{2}} \wedge \ldots \wedge \frac{\partial \mathcal{P}}{\partial \lambda^{n}} \tag{38.44} \end{equation*}(38.44)Pλ1Pλ2Pλn
then the orientation for C C del C\partial CC is
(38.45) P λ 2 P λ n (38.45) P λ 2 P λ n {:(38.45)(delP)/(dellambda^(2))^^dots^^(delP)/(dellambda^(n)):}\begin{equation*} \frac{\partial \mathcal{P}}{\partial \lambda^{2}} \wedge \ldots \wedge \frac{\partial \mathcal{P}}{\partial \lambda^{n}} \tag{38.45} \end{equation*}(38.45)Pλ2Pλn
We shall not prove Stokes' theorem here, 18 18 ^(18){ }^{18}18 instead, we put the theorem to work in the examples below. We warm up with the computation of some volume elements.

Example 38.6

Working in a coordinate frame in three-dimensional space with coordinate system ( x , y , z ) ( x , y , z ) (x,y,z)(x, y, z)(x,y,z) and a metric g g g\boldsymbol{g}g, we want to integrate a function f ( x , y , x ) f ( x , y , x ) f(x,y,x)f(x, y, x)f(x,y,x) over a 3 -volume V V VVV. The volume 3 -form 19 19 ^(19){ }^{19}19 for the space is given by
(38.46) ω ~ ( , , ) = g d x ( ) d y ( ) d z ( ) (38.46) ω ~ ( , , ) = g d x ( ) d y ( ) d z ( ) {:(38.46) tilde(omega)(","",")=sqrtgdx()^^dy()^^dz():}\begin{equation*} \tilde{\boldsymbol{\omega}}(,,)=\sqrt{g} \boldsymbol{d} x() \wedge \boldsymbol{d} y() \wedge \boldsymbol{d} z() \tag{38.46} \end{equation*}(38.46)ω~(,,)=gdx()dy()dz()
The chain C 3 = V C 3 = V C_(3)=VC_{3}=VC3=V tells us the size of the volume over which to integrate, using an integration volume element d V d V dV\mathrm{d} VdV. The latter is a box with sides d x e x , d y e y d x e x , d y e y dxe_(x),dye_(y)\mathrm{d} x e_{x}, \mathrm{~d} y e_{y}dxex, dyey and d z e z d z e z dze_(z)\mathrm{d} z e_{z}dzez, such that we have d V = ω ~ ( d x e x , d y e y , d z e z ) = g d x d y d z d V = ω ~ d x e x , d y e y , d z e z = g d x d y d z dV= tilde(omega)(dxe_(x),(d)ye_(y),(d)ze_(z))=sqrtgdxdydz\mathrm{d} V=\tilde{\boldsymbol{\omega}}\left(\mathrm{d} x \boldsymbol{e}_{x}, \mathrm{~d} y \boldsymbol{e}_{y}, \mathrm{~d} z \boldsymbol{e}_{z}\right)=\sqrt{g} \mathrm{~d} x \mathrm{~d} y \mathrm{~d} zdV=ω~(dxex, dyey, dzez)=g dx dy dz. A volume integral is then written as
(38.47) C 3 f ω ~ = C 3 f g d x d y d z = V f ( x , y , z ) g d x d y d z (38.47) C 3 f ω ~ = C 3 f g d x d y d z = V f ( x , y , z ) g d x d y d z {:(38.47)int_(C_(3))f tilde(omega)=int_(C_(3))fsqrtgdx^^dy^^dz=int_(V)f(x","y","z)sqrtgdxdydz:}\begin{equation*} \int_{C_{3}} f \tilde{\boldsymbol{\omega}}=\int_{C_{3}} f \sqrt{g} \boldsymbol{d} x \wedge \boldsymbol{d} y \wedge \boldsymbol{d} z=\int_{V} f(x, y, z) \sqrt{g} \mathrm{~d} x \mathrm{~d} y \mathrm{~d} z \tag{38.47} \end{equation*}(38.47)C3fω~=C3fgdxdydz=Vf(x,y,z)g dx dy dz
In flat space, of course, we have g = 1 g = 1 g=1g=1g=1.

Example 38.7

Let's integrate a function f ( x , y , z ) f ( x , y , z ) f(x,y,z)f(x, y, z)f(x,y,z) over an area S S SSS in flat space. 20 20 ^(20){ }^{20}20 One way to construct the integration surface element is to identify an element spanned by vectors d u d u du\mathrm{d} \boldsymbol{u}du and d v d v dv\mathrm{d} \boldsymbol{v}dv with unit outward normal n ^ n ^ hat(n)\hat{\boldsymbol{n}}n^ and write as
(38.48) ω ~ ( d u , d v , n ^ ) = n ^ d u × d v = n ^ d S (38.48) ω ~ ( d u , d v , n ^ ) = n ^ d u × d v = n ^ d S {:(38.48) tilde(omega)(du","dv"," hat(n))= hat(n)*du xxdv= hat(n)*dS:}\begin{equation*} \tilde{\boldsymbol{\omega}}(\mathrm{d} \boldsymbol{u}, \mathrm{~d} \boldsymbol{v}, \hat{\boldsymbol{n}})=\hat{\boldsymbol{n}} \cdot \mathrm{d} \boldsymbol{u} \times \mathrm{d} \boldsymbol{v}=\hat{\boldsymbol{n}} \cdot \mathrm{d} \boldsymbol{S} \tag{38.48} \end{equation*}(38.48)ω~(du, dv,n^)=n^du×dv=n^dS
where d S = d u × d v d S = d u × d v dS=du xxdv\mathrm{d} \boldsymbol{S}=\mathrm{d} \boldsymbol{u} \times \mathrm{d} \boldsymbol{v}dS=du×dv. We spot that the expression in eqn 38.48 is equivalent to the 2 -form
(38.49) Σ ~ ( , ) = n ^ x d y d z + n ^ y d z d x + n ^ z d x d y (38.49) Σ ~ ( , ) = n ^ x d y d z + n ^ y d z d x + n ^ z d x d y {:(38.49) tilde(Sigma)(",")= hat(n)^(x)dy^^dz+ hat(n)^(y)dz^^dx+ hat(n)^(z)dx^^dy:}\begin{equation*} \tilde{\mathbf{\Sigma}}(,)=\hat{n}^{x} \boldsymbol{d} y \wedge \boldsymbol{d} z+\hat{n}^{y} \boldsymbol{d} z \wedge \boldsymbol{d} x+\hat{n}^{z} \boldsymbol{d} x \wedge \boldsymbol{d} y \tag{38.49} \end{equation*}(38.49)Σ~(,)=n^xdydz+n^ydzdx+n^zdxdy
when its slots are filled with d u d u du\mathrm{d} \boldsymbol{u}du and d v d v dv\mathrm{d} \boldsymbol{v}dv. We can therefore perform the surface integral
(38.50) C 2 f Σ ~ = S f ( x , y , z ) n ^ d S (38.50) C 2 f Σ ~ = S f ( x , y , z ) n ^ d S {:(38.50)int_(C_(2))f tilde(Sigma)=int_(S)f(x","y","z) hat(n)*dS:}\begin{equation*} \int_{C_{2}} f \tilde{\boldsymbol{\Sigma}}=\int_{S} f(x, y, z) \hat{\boldsymbol{n}} \cdot \mathrm{d} \boldsymbol{S} \tag{38.50} \end{equation*}(38.50)C2fΣ~=Sf(x,y,z)n^dS
Example 38.8
We can now use Stokes' theorem to see how the flat-space integral theorems of vector calculus can be described in the new language by considering a function f f fff (i.e. a 0 -form) integrated over the geometries shown in Fig. 38.2. In Fig. 38.2(a), we have
(38.51) γ d f = f [ P ( B ) ] f [ P ( A ) ] , (38.51) γ d f = f [ P ( B ) ] f [ P ( A ) ] , {:(38.51)int_(gamma)df=f[P(B)]-f[P(A)]",":}\begin{equation*} \int_{\gamma} \boldsymbol{d} f=f[\mathcal{P}(B)]-f[\mathcal{P}(A)], \tag{38.51} \end{equation*}(38.51)γdf=f[P(B)]f[P(A)],
as we saw in Example 12. Similarly, for Fig. 38.2(b) we have
(38.5) S d f d x i = Γ f d x i (38.5) S d f d x i = Γ f d x i {:(38.5)int_(S)df^^dx^(i)=int_(Gamma)fdx^(i):}\begin{equation*} \int_{S} \boldsymbol{d} f \wedge \boldsymbol{d} x^{i}=\int_{\Gamma} f \boldsymbol{d} x^{i} \tag{38.5} \end{equation*}(38.5)Sdfdxi=Γfdxi
and for the volume V V VVV in Fig. 38.2(c) we obtain
(38.53) V d f d x i d x j = Σ f d x i d x j (38.53) V d f d x i d x j = Σ f d x i d x j {:(38.53)int_(V)df^^dx^(i)^^dx^(j)=int_(Sigma)fdx^(i)^^dx^(j):}\begin{equation*} \int_{V} \boldsymbol{d} f \wedge \boldsymbol{d} x^{i} \wedge \boldsymbol{d} x^{j}=\int_{\Sigma} f \boldsymbol{d} x^{i} \wedge \boldsymbol{d} x^{j} \tag{38.53} \end{equation*}(38.53)Vdfdxidxj=Σfdxidxj
To investigate this final geometry in more detail, consider the 2-form
(38.54) A ~ = A x d y d z + A y d z d x + A z d x d y (38.54) A ~ = A x d y d z + A y d z d x + A z d x d y {:(38.54) tilde(A)=A_(x)dy^^dz+A_(y)dz^^dx+A_(z)dx^^dy:}\begin{equation*} \tilde{\boldsymbol{A}}=A_{x} \boldsymbol{d} y \wedge \boldsymbol{d} z+A_{y} \boldsymbol{d} z \wedge \boldsymbol{d} x+A_{z} \boldsymbol{d} x \wedge \boldsymbol{d} y \tag{38.54} \end{equation*}(38.54)A~=Axdydz+Aydzdx+Azdxdy
and the 3-chain C 3 = V C 3 = V C_(3)=VC_{3}=VC3=V, a volume in flat space, with boundary V = Σ V = Σ del V=Sigma\partial V=\SigmaV=Σ. We then have, by Stokes' theorem
Σ A x d y d z + A y d z d x + A z d x (38.55) d y = V ( A x x + A y y + A z z ) d x d y d z Σ A x d y d z + A y d z d x + A z d x (38.55) d y = V A x x + A y y + A z z d x d y d z {:[int_(Sigma)A_(x)dy^^dz+A_(y)dz^^dx+A_(z)dx],[(38.55)^^dy],[=int_(V)((delA_(x))/(del x)+(delA_(y))/(del y)+(delA_(z))/(del z))dx^^dy^^dz]:}\begin{align*} & \int_{\Sigma} A_{x} \boldsymbol{d} y \wedge \boldsymbol{d} z+A_{y} \boldsymbol{d} z \wedge \boldsymbol{d} x+A_{z} \boldsymbol{d} x \\ & \wedge \boldsymbol{d} y \tag{38.55}\\ & =\int_{V}\left(\frac{\partial A_{x}}{\partial x}+\frac{\partial A_{y}}{\partial y}+\frac{\partial A_{z}}{\partial z}\right) \boldsymbol{d} x \wedge \boldsymbol{d} y \wedge \boldsymbol{d} z \end{align*}ΣAxdydz+Aydzdx+Azdx(38.55)dy=V(Axx+Ayy+Azz)dxdydz
In ordinary Cartesian coordinates, this is equivalent to
(38.56) Σ A x d y d z + A y d z d x + A z d x d y = V A d x d y d z (38.56) Σ A x d y d z + A y d z d x + A z d x d y = V A d x d y d z {:(38.56)int_(Sigma)A_(x)dydz+A_(y)dzdx+A_(z)dxdy=int_(V) vec(grad)* vec(A)dxdydz:}\begin{equation*} \int_{\Sigma} A_{x} \mathrm{~d} y \mathrm{~d} z+A_{y} \mathrm{~d} z \mathrm{~d} x+A_{z} \mathrm{~d} x \mathrm{~d} y=\int_{V} \vec{\nabla} \cdot \vec{A} \mathrm{~d} x \mathrm{~d} y \mathrm{~d} z \tag{38.56} \end{equation*}(38.56)ΣAx dy dz+Ay dz dx+Az dx dy=VA dx dy dz
or, in terms of the element d Σ d Σ d vec(Sigma)\mathrm{d} \vec{\Sigma}dΣ of the surface Σ Σ Sigma\SigmaΣ and element d V d V dV\mathrm{d} VdV of the volume V V VVV,
(38.57) Σ A d Σ = V A d V (38.57) Σ A d Σ = V A d V {:(38.57)int_(Sigma) vec(A)*d vec(Sigma)=int_(V) vec(grad)* vec(A)dV:}\begin{equation*} \int_{\Sigma} \vec{A} \cdot \mathrm{~d} \vec{\Sigma}=\int_{V} \vec{\nabla} \cdot \vec{A} \mathrm{~d} V \tag{38.57} \end{equation*}(38.57)ΣA dΣ=VA dV
which is the divergence theorem
Stokes' theorem is an essential part of our geometrical description of the conservation of charge in electromagnetism, and also of the constraints on curvature caused by the geometry of spacetime. These topics will be introduced as we take a closer look at the physics of fields, which forms the final part of the book.

(b)

(c)
Fig. 38.2 Geometries for the theorems of integral calculus.

Chapter summary

  • All integrals can be expressed as an integral over a form. The integral expresses the inner product of the n n nnn-form and the n n nnn-vector that represents the surface over which the integral is carried out.
  • The general surfaces over which we integrate are chains. The boundaries of chains are found with the boundary operator.
  • = 0 = 0 del del=0\partial \partial=0=0 expresses the fact that a boundary of a boundary is zero.
  • Stokes' theorem says that
(38.58) C Ω ~ = C d Ω ~ (38.58) C Ω ~ = C d Ω ~ {:(38.58)int_(del C) tilde(Omega)=int_(C)d tilde(Omega):}\begin{equation*} \int_{\partial C} \tilde{\boldsymbol{\Omega}}=\int_{C} d \tilde{\boldsymbol{\Omega}} \tag{38.58} \end{equation*}(38.58)CΩ~=CdΩ~

Exercises

(38.1) Consider the 1 -form A ~ = A i d x i A ~ = A i d x i tilde(A)=A_(i)dx^(i)\tilde{\boldsymbol{A}}=A_{i} \boldsymbol{d} x^{i}A~=Aidxi in flat, threedimensional space. Use Stokes' theorem to prove the Kelvin-Stokes theorem
(38.59) S ( × A ) d S = S A d l . (38.59) S ( × A ) d S = S A d l . {:(38.59)int_(S)( vec(grad)xx vec(A))*d vec(S)=oint_(del S) vec(A)*d vec(l).:}\begin{equation*} \int_{S}(\vec{\nabla} \times \vec{A}) \cdot \mathrm{d} \vec{S}=\oint_{\partial S} \vec{A} \cdot \mathrm{~d} \vec{l} . \tag{38.59} \end{equation*}(38.59)S(×A)dS=SA dl.
(38.2) The ball S 2 S 2 S^(2)S^{2}S2 has no boundary.
(a) Prove that if a 2 -form Ω ~ Ω ~ tilde(Omega)\tilde{\boldsymbol{\Omega}}Ω~ is exact on S 2 S 2 S^(2)S^{2}S2, we nust have
(38.60) S 2 Ω ~ = 0 (38.60) S 2 Ω ~ = 0 {:(38.60)int_(S^(2)) tilde(Omega)=0:}\begin{equation*} \int_{S^{2}} \tilde{\boldsymbol{\Omega}}=0 \tag{38.60} \end{equation*}(38.60)S2Ω~=0
(b) Show that the 2-form G ~ = x 1 d x 2 d x 3 G ~ = x 1 d x 2 d x 3 tilde(G)=x^(1)dx^(2)^^dx^(3)\tilde{\boldsymbol{G}}=x^{1} \boldsymbol{d} x^{2} \wedge \boldsymbol{d} x^{3}G~=x1dx2dx3 defined in three-dimensional space, is not exact on S 2 S 2 S^(2)S^{2}S2.
(38.3) Consider the vortex 1-form field
(38.61) ϕ ~ = A r 2 ( y d x + x d y ) , (38.61) ϕ ~ = A r 2 ( y d x + x d y ) , {:(38.61) tilde(phi)=(A)/(r^(2))(-ydx+xdy)",":}\begin{equation*} \tilde{\phi}=\frac{A}{r^{2}}(-y \boldsymbol{d} x+x \boldsymbol{d} y), \tag{38.61} \end{equation*}(38.61)ϕ~=Ar2(ydx+xdy),
where r 2 = x 2 + y 2 r 2 = x 2 + y 2 r^(2)=x^(2)+y^(2)r^{2}=x^{2}+y^{2}r2=x2+y2 and A A AAA is a constant. This form is not exact since ϕ ~ d f ϕ ~ d f tilde(phi)!=df\tilde{\phi} \neq \boldsymbol{d} fϕ~df, for any function f f fff. Show, however, that the vortex form is closed, with d ϕ = 0 d ϕ = 0 d phi=0\boldsymbol{d} \phi=0dϕ=0.
This is an example of a closed form that is not exact. Although we appear to be working in Euclidean 2-space (which is simply connected) where we expect closed forms to be exact, the field is actually singular at the origin. As a result, we must remove this troublesome point. We are not therefore really dealing with a simply connected space and so Poincaré's lemma does not apply.
(38.4) (a) By computing connection coefficients in the orthonormal frame, show that the geodesic equation on the surface of a unit sphere can be written as
(38.62) ( x ¨ 1 ^ x ¨ 2 ^ ) = ϕ ˙ cos θ ( 0 1 1 0 ) ( x ˙ 1 ^ x ˙ 2 ^ ) (38.62) ( x ¨ 1 ^ x ¨ 2 ^ ) = ϕ ˙ cos θ 0 1 1 0 ( x ˙ 1 ^ x ˙ 2 ^ ) {:(38.62)((x^(¨)^( hat(1)))/(x^(¨)^( hat(2))))=-phi^(˙)cos theta([0,-1],[1,0])((x^(˙)^( hat(1)))/(x^(˙)^( hat(2)))):}\binom{\ddot{x}^{\hat{1}}}{\ddot{x}^{\hat{2}}}=-\dot{\phi} \cos \theta\left(\begin{array}{cc} 0 & -1 \tag{38.62}\\ 1 & 0 \end{array}\right)\binom{\dot{x}^{\hat{1}}}{\dot{x}^{\hat{2}}}(38.62)(x¨1^x¨2^)=ϕ˙cosθ(0110)(x˙1^x˙2^)
where ( x 1 , x 2 ) = ( θ , ϕ ) x 1 , x 2 = ( θ , ϕ ) (x^(1),x^(2))=(theta,phi)\left(x^{1}, x^{2}\right)=(\theta, \phi)(x1,x2)=(θ,ϕ) and dots denote derivatives with respect to the proper time τ τ tau\tauτ.
This equation can be solved by a velocity of the form u ( t ) = e A ( t ) u ( 0 ) u ( t ) = e A ( t ) u ( 0 ) u(t)=e^(A(t))u(0)\boldsymbol{u}(t)=\mathrm{e}^{\boldsymbol{A}(t)} \boldsymbol{u}(0)u(t)=eA(t)u(0) where, for our chosen basis,
A ( t ) = 0 t d τ cos θ ( τ ) ϕ ˙ ( τ ) ( 0 1 1 0 ) A _ ( t ) = 0 t d τ cos θ ( τ ) ϕ ˙ ( τ ) 0 1 1 0 A_(t)=-int_(0)^(t)dtau cos theta(tau)phi^(˙)(tau)([0,-1],[1,0])\underline{\boldsymbol{A}}(t)=-\int_{0}^{t} \mathrm{~d} \tau \cos \theta(\tau) \dot{\phi}(\tau)\left(\begin{array}{cc} 0 & -1 \\ 1 & 0 \end{array}\right)A(t)=0t dτcosθ(τ)ϕ˙(τ)(0110)
(b) Use Stokes' theorem to verify Green's theorem in the plane
(38.64) γ ( L d x + M d y ) = S d x d y ( M x L y ) (38.64) γ ( L d x + M d y ) = S d x d y M x L y {:(38.64)oint_(gamma)(Ldx+Mdy)=int_(S)dxdy((del M)/(del x)-(del L)/(del y)):}\begin{equation*} \oint_{\gamma}(L \mathrm{~d} x+M \mathrm{~d} y)=\int_{S} \mathrm{~d} x \mathrm{~d} y\left(\frac{\partial M}{\partial x}-\frac{\partial L}{\partial y}\right) \tag{38.64} \end{equation*}(38.64)γ(L dx+M dy)=S dx dy(MxLy)
(c) Use Green's theorem in the plane to show that if the motion on the sphere forms a loop after a time T T TTT, then the integral I I I\mathcal{I}I gives
(38.65) I = 0 T d τ cos θ ( τ ) ϕ ˙ ( τ ) = A (38.65) I = 0 T d τ cos θ ( τ ) ϕ ˙ ( τ ) = A {:(38.65)I=int_(0)^(T)dtau cos theta(tau)phi^(˙)(tau)=-A:}\begin{equation*} \mathcal{I}=\int_{0}^{T} \mathrm{~d} \tau \cos \theta(\tau) \dot{\phi}(\tau)=-\mathcal{A} \tag{38.65} \end{equation*}(38.65)I=0T dτcosθ(τ)ϕ˙(τ)=A
where A A A\mathcal{A}A is the area enclosed by the loop.
(d) Hence, show that if the vector u u u\boldsymbol{u}u is paralle
transported around a closed loop, the result can be described by the operation R u R _ u R_u\underline{\boldsymbol{R}} \boldsymbol{u}Ru, where R R _ R_\underline{\boldsymbol{R}}R is a matrix corresponding to rotation by an angle A A A\mathcal{A}A. See Blennow and Ohlsson, whose approach we have
followed here, for a more complete discussion of this problem. The result of this problem is a useful one in magnetism for computing geometric phases.

Part VI

Classical and quantum fields

Having sharpened our geometrical tools in Part V, we now apply these ideas to a geometrical description of the physics of general relativity. Our goal is to understand the physics of relativity from both the point of view of differential geometry and also from that of classical field theory. 'Classical' here means pre-quantum, but including relativity. The extension to quantum physics is our focus towards the end of this part of the book.
  • In Chapter 39, we examine the physics of fluids, including their equations of motion and thermodynamics.
  • In Chapter 40, we approach the Einstein equations from the point of view of classical field theory where using the idea of a Lagrangian density we can see how general relativity fits into the framework of field theory. We apply this to the problem of the early Universe in Chapter 41.
  • In Chapter 42, we formulate a geometrical description of electromagnetism. This is developed in Chapter 43, where we use differential geometry to understand the conservation of charge.
  • In Chapter 44, we examine the link between relativity and the very influential idea of a gauge in a field theory.
  • In Chapter 45, we examine gravitation in the domain where it is usually encountered: the limit of small field. This enables us to discuss gravitational radiation in Chapter 46.
  • In Chapters 47-49, we investigate how quantum mechanics might be linked with gravitation, and how having extra dimensions available might enable this.
  • Finally, in Chapter 50, we show the inevitability of a Big Bang from our geometrical viewpoint and its reliance on smooth spacetimes.

Fluids as dry water

39.1 Euler's equation 413
39.2 Energy and Bernoulli's equa
39.3 Energy-momentum tensor
39.4 Relativistic fluids 420
Chapter summary 425
Exercises 425
\curvearrowright The material in this chapter fills in some background on how fluids are treated in physics. It can be skipped on a first reading.
1 1 ^(1){ }^{1}1 This approach is adopted in Vol. II, Lecture 40 of the Feynman Lectures in Physics, R.P. Feynman, R. Leighton and M. Sands. In the lecture after this, Feynman examines some of the consequences of including viscosity, and the interested reader is advised to look there in the first instance.
2 2 ^(2){ }^{2}2 We shall distinguish the mass density ρ 0 ρ 0 rho_(0)\rho_{0}ρ0 of the fluid and the energy density ρ c 2 ρ c 2 rhoc^(2)\rho c^{2}ρc2, which includes a contribution from ρ 0 c 2 ρ 0 c 2 rho_(0)c^(2)\rho_{0} c^{2}ρ0c2 and also sources of internal energy
Fig. 39.1 Flow lines form a congruence of curves. The tangents give the velocity field.
He who drinks a tumbler of water in London has literally in his stomach more animated beings than there are men, women and children on the face of the globe.
Sydney Smith (1771-1845)
A particularly satisfying classical field theory is that of the fluid. In this chapter, we start by examining a non-relativistic fluid in flat spacetime, and formulate mass conservation and an equation of motion, and then look at the consequences of energy conservation. We then turn to the problem of the relativistic fluid. At the risk of being perverse, we shall ignore exactly that feature that gives fluids their characteristic wetness: viscosity. This omission is made because the fluids we describe in the context of relativity are well described as inviscid (i.e. viscosityfree). This approach 1 1 ^(1){ }^{1}1 also has the attractive property of making the subject as straightforward as possible.
A fluid is matter that continuously deforms if it is subject to a shear stress. In modelling a fluid, we identify elements of the fluid, initially at some coordinate x μ = ( t , x ) x μ = ( t , x ) x^(mu)=(t, vec(x))x^{\mu}=(t, \vec{x})xμ=(t,x), that we expect to move around. This movement is called a flow. To describe a fluid, we specify its mass density, 2 2 ^(2){ }^{2}2 described by a scalar field ρ 0 ( t , x ) ρ 0 ( t , x ) rho_(0)(t, vec(x))\rho_{0}(t, \vec{x})ρ0(t,x) and its velocity, which is described by a vector field v ( t , x ) v ( t , x ) vec(v)(t, vec(x))\vec{v}(t, \vec{x})v(t,x).
The velocity is best thought of as being a tangent field. The tangents in question are tangents to a congruence of curves. As usual, the congruence fills all of the space, so that there is a single tangent at a particular point. This tangent is identical to the velocity vector field evaluated at that point (Fig. 39.1). The congruent curves are known as the streamlines of the fluid. (They are also known as the flow lines, and they do indeed describe the flow of the fluid.) Defined in this way, the flow lines tell us about the velocities of the elements of the fluid, rather than their trajectories. Flow lines are only identical to the trajectories of the fluid elements in the case of steady flow of the fluid.
We want to ensure that the mass of the fluid is locally conserved, which is to say that (in the absence of sources and sinks of mass), the change of fluid mass in a particular volume is accounted for by the flow of mass into or out of that volume. We define the fluid's mass current vector J J J\boldsymbol{J}J, which has components J μ = ( ρ 0 , ρ 0 v ) J μ = ρ 0 , ρ 0 v J^(mu)=(rho_(0),rho_(0)( vec(v)))J^{\mu}=\left(\rho_{0}, \rho_{0} \vec{v}\right)Jμ=(ρ0,ρ0v). Conservation of mass is then written as J = 0 J = 0 grad*J=0\boldsymbol{\nabla} \cdot \boldsymbol{J}=0J=0, or in components
(39.1) ρ 0 t + ( ρ 0 v ) = 0 (39.1) ρ 0 t + ρ 0 v = 0 {:(39.1)(delrho_(0))/(del t)+ vec(grad)*(rho_(0)( vec(v)))=0:}\begin{equation*} \frac{\partial \rho_{0}}{\partial t}+\vec{\nabla} \cdot\left(\rho_{0} \vec{v}\right)=0 \tag{39.1} \end{equation*}(39.1)ρ0t+(ρ0v)=0
Finally, a useful quantity known as the vorticity of the fluid is defined as
(39.2) ω = × v . (39.2) ω = × v . {:(39.2) vec(omega)= vec(grad)xx vec(v).:}\begin{equation*} \vec{\omega}=\vec{\nabla} \times \vec{v} . \tag{39.2} \end{equation*}(39.2)ω=×v.
A fluid with ω = 0 ω = 0 vec(omega)=0\vec{\omega}=0ω=0 is called irrotational.

39.1 Euler's equation

We would like to identify an equation of motion for the fluid. We shall do this non-relativistically by using Newton's second law. If a volume of fluid has a pressure p ( t , x ) p ( t , x ) p(t, vec(x))p(t, \vec{x})p(t,x) then we have access to the local force on an element of fluid
(39.3) F = d S p ( t , x ) = d 3 x p (39.3) F = d S p ( t , x ) = d 3 x p {:(39.3) vec(F)=-ointd vec(S)p(t"," vec(x))=-intd^(3)x vec(grad)p:}\begin{equation*} \vec{F}=-\oint \mathrm{d} \vec{S} p(t, \vec{x})=-\int \mathrm{d}^{3} x \vec{\nabla} p \tag{39.3} \end{equation*}(39.3)F=dSp(t,x)=d3xp
where d S d S d vec(S)\mathrm{d} \vec{S}dS is a directed element of area of the fluid element, and d 3 x d 3 x d^(3)x\mathrm{d}^{3} xd3x is an element of volume (Fig. 39.2). The equation effectively defines the pressure of the fluid in terms of it providing an inward-directed force (hence the minus sign) on an element. 3 3 ^(3){ }^{3}3 The product of mass and acceleration of the fluid is
(39.4) d 3 x ρ 0 d v d t (39.4) d 3 x ρ 0 d v d t {:(39.4)intd^(3)xrho_(0)((d)( vec(v)))/((d)t):}\begin{equation*} \int \mathrm{d}^{3} x \rho_{0} \frac{\mathrm{~d} \vec{v}}{\mathrm{~d} t} \tag{39.4} \end{equation*}(39.4)d3xρ0 dv dt
Equating these two expressions 4 4 ^(4){ }^{4}4 gives an equation of motion
(39.6) ρ 0 d v d t = p (39.6) ρ 0 d v d t = p {:(39.6)rho_(0)((d)( vec(v)))/((d)t)=- vec(grad)p:}\begin{equation*} \rho_{0} \frac{\mathrm{~d} \vec{v}}{\mathrm{~d} t}=-\vec{\nabla} p \tag{39.6} \end{equation*}(39.6)ρ0 dv dt=p
At this point, we need to pause and consider the details of the frames of reference we are using. The velocity field is a function of time and position. The total time derivative d / d t d / d t d//dt\mathrm{d} / \mathrm{d} td/dt we have assumed does not fix the space coordinates like a partial derivative does. 5 5 ^(5){ }^{5}5 The total derivative therefore corresponds to the rate of change of the velocity as observed by an observer who travels along with fluid, so that their spatial coordinate is carried along with the element of fluid under consideration. This is the same situation we had in Chapter 15 where we described the observer as comoving. We conclude that the comoving observer probes the fluid using the total derivative d / d t d / d t d//dt\mathrm{d} / \mathrm{d} td/dt.
Example 39.1
The total derivative describes the rate of change measured by an observer moving locally with the fluid. However, we typically have access to the partial derivative: the rate of change of the fluid from our fixed frame of reference, relative to which the the rate of change of the flud from our fixed frame of reference, relative to which the
flan i i iii th component of a vector field A ( t , x ) A ( t , x ) vec(A)(t, vec(x))\vec{A}(t, \vec{x})A(t,x), we find
d A i = A i t d t + A i x j d x j d A i d t = A i t + A i x j d x j d t (39.7) = A i t + v j A i x j = ( t + v j x j ) A i , d A i = A i t d t + A i x j d x j d A i d t = A i t + A i x j d x j d t (39.7) = A i t + v j A i x j = t + v j x j A i , {:[dA^(i)=(delA^(i))/(del t)*dt+(delA^(i))/(delx^(j))dx^(j)],[((d)A^(i))/((d)t)=(delA^(i))/(del t)+(delA^(i))/(delx^(j))((d)x^(j))/((d)t)],[(39.7)=(delA^(i))/(del t)+v^(j)(delA^(i))/(delx^(j))=((del)/(del t)+v^(j)(del)/(delx^(j)))A^(i)","]:}\begin{align*} \mathrm{d} A^{i} & =\frac{\partial A^{i}}{\partial t} \cdot \mathrm{~d} t+\frac{\partial A^{i}}{\partial x^{j}} \mathrm{~d} x^{j} \\ \frac{\mathrm{~d} A^{i}}{\mathrm{~d} t} & =\frac{\partial A^{i}}{\partial t}+\frac{\partial A^{i}}{\partial x^{j}} \frac{\mathrm{~d} x^{j}}{\mathrm{~d} t} \\ & =\frac{\partial A^{i}}{\partial t}+v^{j} \frac{\partial A^{i}}{\partial x^{j}}=\left(\frac{\partial}{\partial t}+v^{j} \frac{\partial}{\partial x^{j}}\right) A^{i}, \tag{39.7} \end{align*}dAi=Ait dt+Aixj dxj dAi dt=Ait+Aixj dxj dt(39.7)=Ait+vjAixj=(t+vjxj)Ai,
Fig. 39.2 An outwardly directed element of fluid surface at pressure p p ppp used to define the force.
3 3 ^(3){ }^{3}3 The surface integral in eqn 39.3 is taken over the surface of the element of fluid, on which the pressure of the rest of the fluid acts. From the volume integral we see immediately that if there is no gradient in the pressure field, there no gradient in the pressure field, there
is no net force on the element, which makes sense.
4 4 ^(4){ }^{4}4 That is, we find that
d 3 x ( ρ 0 d v d t + p ) = 0 d 3 x ρ 0 d v d t + p = 0 intd^(3)x(rho_(0)((d)( vec(v)))/((d)t)+( vec(grad))p)=0\int \mathrm{d}^{3} x\left(\rho_{0} \frac{\mathrm{~d} \vec{v}}{\mathrm{~d} t}+\vec{\nabla} p\right)=0d3x(ρ0 dv dt+p)=0
and so the integrand must vanish.
5 5 ^(5){ }^{5}5 The partial derivative with respect to time should be interpreted as reading
( t ) fixed position t  fixed   position  ((del)/(del t))_({:[" fixed "],[" position "]:})\left(\frac{\partial}{\partial t}\right)_{\substack{\text { fixed } \\ \text { position }}}(t) fixed  position 
6 6 ^(6){ }^{6}6 We summarize two useful rules for non-relativistic fluid dynamics:
(i) The derivative rule: we use d / d t d / d t d//dt\mathrm{d} / \mathrm{d} td/dt in the comoving frame and / t + v / t + v del//del t+ vec(v)* vec(grad)\partial / \partial t+\vec{v} \cdot \vec{\nabla}/t+v in the stationary frame.
(ii) The divergence rule: the velocity's divergence (a quantity evaluated in the stationary frame) is given in the comoving frame in terms of the rate of change of the volume of a fluid element, viz.
v = 1 V d V d t v = 1 V d V d t vec(grad)* vec(v)=(1)/(V)((d)V)/((d)t)\vec{\nabla} \cdot \vec{v}=\frac{1}{V} \frac{\mathrm{~d} V}{\mathrm{~d} t}v=1V dV dt
where v j = x j / t v j = x j / t v^(j)=delx^(j)//del tv^{j}=\partial x^{j} / \partial tvj=xj/t is a component of the fluid's velocity. This expression provides a useful link between total derivative (appropriate for the comoving observer) and partial derivative (appropriate for the stationary observer).
Using the previous example, we have the useful equation in vector notation, that
(39.8) d A d t = A t + ( v ) A . (39.8) d A d t = A t + ( v ) A . {:(39.8)(d( vec(A)))/((d)t)=(del( vec(A)))/(del t)+( vec(v)* vec(grad)) vec(A).:}\begin{equation*} \frac{\mathrm{d} \vec{A}}{\mathrm{~d} t}=\frac{\partial \vec{A}}{\partial t}+(\vec{v} \cdot \vec{\nabla}) \vec{A} . \tag{39.8} \end{equation*}(39.8)dA dt=At+(v)A.
This allows us to define the convective derivative operator
(39.9) d d t = t + v . (39.9) d d t = t + v . {:(39.9)(d)/((d)t)=(del)/(del t)+ vec(v)* vec(grad).:}\begin{equation*} \frac{\mathrm{d}}{\mathrm{~d} t}=\frac{\partial}{\partial t}+\vec{v} \cdot \vec{\nabla} . \tag{39.9} \end{equation*}(39.9)d dt=t+v.
Using this operator, we can convert our equations between comoving and stationary frames. Notice how the right-hand side of this equation has two contributions: (i) the rate of change of the vector field at a fixed spatial coordinate, added to (ii) the directional derivative (i.e. the derivative directed along the velocity vector v v vec(v)\vec{v}v ). Another way of describing the directional derivative is that v v vec(v)* vec(grad)\vec{v} \cdot \vec{\nabla}v represents a spatial derivative taken along the flow lines.

Example 39.2

Our mass conservation equation J = 0 J = 0 grad*J=0\boldsymbol{\nabla} \cdot \boldsymbol{J}=0J=0 was previously written in terms of partial derivatives and so corresponds to the viewpoint of a stationary observer. We can use the convective derivative equation to shift this into the comoving frame. First expand the conservation equation, and then use the convective derivative to yield
ρ 0 t + ( ρ 0 ) v + ρ 0 v = 0 (expanding) d ρ 0 d t = ρ 0 v (using convective derivative). . = . ρ 0 t + ρ 0 v + ρ 0 v = 0  (expanding)  d ρ 0 d t = ρ 0 v  (using convective derivative).  . = {:(delrho_(0))/(del t)+([{:( vec(grad))rho_(0))* vec(v)+rho_(0) vec(grad)* vec(v)=0," (expanding) "],[(drho_(0))/((d)t)=-rho_(0) vec(grad)* vec(v)," (using convective derivative). "].=:}". ":}\begin{array}{cl} \frac{\partial \rho_{0}}{\partial t}+\left(\begin{array}{ll} \left.\vec{\nabla} \rho_{0}\right) \cdot \vec{v}+\rho_{0} \vec{\nabla} \cdot \vec{v}=0 & \text { (expanding) } \\ \frac{\mathrm{d} \rho_{0}}{\mathrm{~d} t}=-\rho_{0} \vec{\nabla} \cdot \vec{v} & \text { (using convective derivative). } \end{array} .=\right.\text {. } \end{array}ρ0t+(ρ0)v+ρ0v=0 (expanding) dρ0 dt=ρ0v (using convective derivative). .=
In the comoving frame, the observer who follows the fluid is able to identify a fixed mass of fluid M M MMM, which should not vary as a function of time. The mass of fluid instantaneously fills some volume V V VVV and has a density ρ 0 ρ 0 rho_(0)\rho_{0}ρ0 with M = ρ 0 V M = ρ 0 V M=rho_(0)VM=\rho_{0} VM=ρ0V. We have, therefore, that
(39.10) d M d t = d ρ 0 d t V + ρ 0 d V d t = 0 (39.10) d M d t = d ρ 0 d t V + ρ 0 d V d t = 0 {:(39.10)(dM)/((d)t)=(drho_(0))/((d)t)V+rho_(0)((d)V)/((d)t)=0:}\begin{equation*} \frac{\mathrm{d} M}{\mathrm{~d} t}=\frac{\mathrm{d} \rho_{0}}{\mathrm{~d} t} V+\rho_{0} \frac{\mathrm{~d} V}{\mathrm{~d} t}=0 \tag{39.10} \end{equation*}(39.10)dM dt=dρ0 dtV+ρ0 dV dt=0
Combining the previous two exprescions allows us to conclude that the divergence of the velocity is given in terms of the fluid's volume by 6 6 ^(6){ }^{6}6
(39.11) v = 1 V d V d t (39.11) v = 1 V d V d t {:(39.11) vec(grad)* vec(v)=(1)/(V)((d)V)/((d)t):}\begin{equation*} \vec{\nabla} \cdot \vec{v}=\frac{1}{V} \frac{\mathrm{~d} V}{\mathrm{~d} t} \tag{39.11} \end{equation*}(39.11)v=1V dV dt
Expanding the equation of motion (eqn 39.6) in terms of the convective derivative, results in Euler's equation for the motion of the fluid for the stationary observer, which reads
(39.12) v t + ( v ) v = 1 ρ 0 p (39.12) v t + ( v ) v = 1 ρ 0 p {:(39.12)(del( vec(v)))/(del t)+( vec(v)* vec(grad)) vec(v)=-(1)/(rho_(0)) vec(grad)p:}\begin{equation*} \frac{\partial \vec{v}}{\partial t}+(\vec{v} \cdot \vec{\nabla}) \vec{v}=-\frac{1}{\rho_{0}} \vec{\nabla} p \tag{39.12} \end{equation*}(39.12)vt+(v)v=1ρ0p
If gravity is also present then this provides an additional acceleration g g vec(g)\vec{g}g and we write
(39.13) v t + ( v ) v = 1 ρ 0 p + g (39.13) v t + ( v ) v = 1 ρ 0 p + g {:(39.13)(del( vec(v)))/(del t)+( vec(v)* vec(grad)) vec(v)=-(1)/(rho_(0))* vec(grad)p+ vec(g):}\begin{equation*} \frac{\partial \vec{v}}{\partial t}+(\vec{v} \cdot \vec{\nabla}) \vec{v}=-\frac{1}{\rho_{0}} \cdot \vec{\nabla} p+\vec{g} \tag{39.13} \end{equation*}(39.13)vt+(v)v=1ρ0p+g
Example 39.3
Hydrostatics refers to an equilibrium state of affairs where we have a vanishing velocity field, so we then have a hydrostatic equation
(39.14) p = ρ 0 g (hydrostatic fluid). (39.14) p = ρ 0 g  (hydrostatic fluid).  {:(39.14) vec(grad)p=rho_(0) vec(g)quad" (hydrostatic fluid). ":}\begin{equation*} \vec{\nabla} p=\rho_{0} \vec{g} \quad \text { (hydrostatic fluid). } \tag{39.14} \end{equation*}(39.14)p=ρ0g (hydrostatic fluid). 
The existence of a hydrostatic fluid allows us to derive 7 7 ^(7){ }^{7}7 Pascal's law. Expanding the last expression in Cartesian coordinates and assuming gravity acts in the z z -z-zz direction, we have
(39.15) p x = p y = 0 , p z = ρ 0 g (39.15) p x = p y = 0 , p z = ρ 0 g {:(39.15)(del p)/(del x)=(del p)/(del y)=0","quad(del p)/(del z)=-rho_(0)g:}\begin{equation*} \frac{\partial p}{\partial x}=\frac{\partial p}{\partial y}=0, \quad \frac{\partial p}{\partial z}=-\rho_{0} g \tag{39.15} \end{equation*}(39.15)px=py=0,pz=ρ0g
which we integrate, to give
(39.16) p ( z ) = ρ 0 g z + const , (39.16) p ( z ) = ρ 0 g z +  const  , {:(39.16)p(z)=-rho_(0)gz+" const "",":}\begin{equation*} p(z)=-\rho_{0} g z+\text { const }, \tag{39.16} \end{equation*}(39.16)p(z)=ρ0gz+ const ,
which is Pascal's law, giving the variation of pressure with depth in a hydrostatic fluid.
The gravitational field can be written in terms of a gravitational potential g = Φ g = Φ vec(g)=- vec(grad)Phi\vec{g}=-\vec{\nabla} \Phig=Φ. The gravitational potential fits more comfortably into a description of fluids in terms of their energy.
Example 39.4
Consider the cross product of the velocity and the vorticity ( ω = × v ω = × v vec(omega)= vec(grad)xx vec(v)\vec{\omega}=\vec{\nabla} \times \vec{v}ω=×v )
v × ω = v × ( × v ) (39.17) = 1 2 v 2 ( v ) v v × ω = v × ( × v ) (39.17) = 1 2 v 2 ( v ) v {:[ vec(v)xx vec(omega)= vec(v)xx( vec(grad)xx vec(v))],[(39.17)=(1)/(2) vec(grad)v^(2)-( vec(v)* vec(grad)) vec(v)]:}\begin{align*} \vec{v} \times \vec{\omega} & =\vec{v} \times(\vec{\nabla} \times \vec{v}) \\ & =\frac{1}{2} \vec{\nabla} v^{2}-(\vec{v} \cdot \vec{\nabla}) \vec{v} \tag{39.17} \end{align*}v×ω=v×(×v)(39.17)=12v2(v)v
where we have used the rule for a double cross product in the second line. This expression can be used to remove the directional derivative term from Euler's equation (eqn 39.12). We also take the opportunity to replace g g vec(g)\vec{g}g with Φ Φ - vec(grad)Phi-\vec{\nabla} \PhiΦ, with the result
(39.18) v t + ( 1 2 v 2 + Φ ) + p ρ 0 v × ω = 0 (39.18) v t + 1 2 v 2 + Φ + p ρ 0 v × ω = 0 {:(39.18)(del( vec(v)))/(del t)+ vec(grad)((1)/(2)*v^(2)+Phi)+(( vec(grad))p)/(rho_(0))- vec(v)xx vec(omega)=0:}\begin{equation*} \frac{\partial \vec{v}}{\partial t}+\vec{\nabla}\left(\frac{1}{2} \cdot v^{2}+\Phi\right)+\frac{\vec{\nabla} p}{\rho_{0}}-\vec{v} \times \vec{\omega}=0 \tag{39.18} \end{equation*}(39.18)vt+(12v2+Φ)+pρ0v×ω=0
This is a restatement of the Euler equation, but is also the most general form of what is known as Bernoulli's equation. 8 8 ^(8){ }^{8}8 We shall see how this equation arises from energetic considerations in the next section.
The Euler equation gives us an equation of motion for the fluid. However, there's lots of additional insight we can gain into motion if we consider the energetics of the fluid. These considerations are our next topic.

39.2 Energy and Bernoulli's equation

The thermodynamics of the fluid at an equilibrium temperature T T TTT may be investigated using the first law of thermodynamics
(39.19) d U = T d S p d V (39.19) d U = T d S p d V {:(39.19)dU=TdS-pdV:}\begin{equation*} \mathrm{d} U=T \mathrm{~d} S-p \mathrm{~d} V \tag{39.19} \end{equation*}(39.19)dU=T dSp dV
7 7 ^(7){ }^{7}7 Blaise Pascal (1623-1662), mathematician, physicist, philosopher, theologian and inventor, was described by Roberto Rossellini as 'a boring man who never made love in his life'. Rossellini made a well-regarded film about Pascal's life for French television in 1972.
8 8 ^(8){ }^{8}8 Daniel Bernoulli (1700-1782) was the son of Johann Bernoulli (famed for his developments in calculus) and the nephew of Jacob Bernoulli (who founded the study of probability). Daniel's work on fluids was published as Hydrodynamica. Parts of his work were plagiarized by his father and published in Johann's book Hydraulica. Such behaviour is not uncommon in the story of the Bernoulli family.
9 9 ^(9){ }^{9}9 Writing U ( S , V ) U ( S , V ) U(S,V)U(S, V)U(S,V) implies that entropy and volume are the natural variables of U U UUU. These are the quantities that appear as differentials in the first law.
where U ( S , V ) U ( S , V ) U(S,V)U(S, V)U(S,V) is the internal energy and S S SSS is the entropy. 9 9 ^(9){ }^{9}9 Of course,
the first law simply expresses energy conservation. We want to work with the first law in units where the internal energy and the entropy are given per unit mass (these quantities are denoted by lower-case letters). Unit mass implies that volume V V VVV can be replaced with 1 / ρ 0 1 / ρ 0 1//rho_(0)1 / \rho_{0}1/ρ0 and we have a unit-mass version of the first law, written as
(39.20) d u = T d s p d ( 1 ρ 0 ) . (39.20) d u = T d s p d 1 ρ 0 . {:(39.20)du=Tds-pd((1)/(rho_(0))).:}\begin{equation*} \mathrm{d} u=T \mathrm{~d} s-p \mathrm{~d}\left(\frac{1}{\rho_{0}}\right) . \tag{39.20} \end{equation*}(39.20)du=T dsp d(1ρ0).
The natural variables of u u uuu are s s sss and ρ 0 ρ 0 rho_(0)\rho_{0}ρ0. In situations where the natural variables of a problem are s s sss and p p ppp, it is often useful to describe the physics using the enthalpy H ( S , p ) = U + p V H ( S , p ) = U + p V H(S,p)=U+pVH(S, p)=U+p VH(S,p)=U+pV and so we have, in unit mass form,
(39.21) d h = T d s + 1 ρ 0 d p , (39.21) d h = T d s + 1 ρ 0 d p , {:(39.21)dh=Tds+(1)/(rho_(0))dp",":}\begin{equation*} \mathrm{d} h=T \mathrm{~d} s+\frac{1}{\rho_{0}} \mathrm{~d} p, \tag{39.21} \end{equation*}(39.21)dh=T ds+1ρ0 dp,
where h h hhh is enthalpy per unit mass. This implies we can write the enthalpy as h = u + p / ρ 0 h = u + p / ρ 0 h=u+p//rho_(0)h=u+p / \rho_{0}h=u+p/ρ0.
In a perfect fluid, there is no dissipation, and so we must have no entropy production, i.e. d s = 0 d s = 0 ds=0\mathrm{d} s=0ds=0. In this case,
(39.22) d u = p d ( 1 ρ 0 ) , ( perfect fluid ) . (39.22) d u = p d 1 ρ 0 , (  perfect   fluid  ) . {:(39.22){:[du=-pd((1)/(rho_(0)))",",((" perfect ")/(" fluid ")).]:}:}\begin{array}{cc} \mathrm{d} u=-p \mathrm{~d}\left(\frac{1}{\rho_{0}}\right), & \binom{\text { perfect }}{\text { fluid }} . \tag{39.22} \end{array}(39.22)du=p d(1ρ0),( perfect  fluid ).
From the former expression for the first law d u = p d ( 1 / ρ 0 ) d u = p d 1 / ρ 0 du=-pd(1//rho_(0))\mathrm{d} u=-p \mathrm{~d}\left(1 / \rho_{0}\right)du=p d(1/ρ0), we find an expression for the pressure of a perfect fluid, which is
(39.23) p = ρ 0 2 d u d ρ 0 ( perfect fluid ) . (39.23) p = ρ 0 2 d u d ρ 0 (  perfect   fluid  ) . {:(39.23)p=rho_(0)^(2)((d)u)/((d)rho_(0))quad((" perfect ")/(" fluid ")).:}\begin{equation*} p=\rho_{0}^{2} \frac{\mathrm{~d} u}{\mathrm{~d} \rho_{0}} \quad\binom{\text { perfect }}{\text { fluid }} . \tag{39.23} \end{equation*}(39.23)p=ρ02 du dρ0( perfect  fluid ).

Example 39.5

Evaluated in the static frame, we note that s / t = 0 s / t = 0 del s//del t=0\partial s / \partial t=0s/t=0 for the perfect fluid. However, the entropy is constant when following the flow (in the comoving frame with d / d t d / d t d//dt\mathrm{d} / \mathrm{d} td/dt ) as well as in the static frame (where we use the convective derivative / t + v / t + v del//del t+ vec(v)* vec(grad)\partial / \partial t+\vec{v} \cdot \vec{\nabla}/t+v ), with the consequence that we can write
(39.24) d s d t = ( v ) s = 0 ( perfect fluid ) (39.24) d s d t = ( v ) s = 0 (  perfect   fluid  ) {:(39.24)(ds)/((d)t)=( vec(v)* vec(grad))s=0quad((" perfect ")/(" fluid ")):}\begin{equation*} \frac{\mathrm{d} s}{\mathrm{~d} t}=(\vec{v} \cdot \vec{\nabla}) s=0 \quad\binom{\text { perfect }}{\text { fluid }} \tag{39.24} \end{equation*}(39.24)ds dt=(v)s=0( perfect  fluid )
This latter equation is useful in simplifying thermodynamic descriptions of a perfect fluid.
For the special case of steady flow, we have the defining property t ( t ( (del)/(del t)(\frac{\partial}{\partial t}(t( anything ) = 0 ) = 0 )=0)=0)=0. As a result, we can use a simplifying expression
(39.25) d d t = ( v ) (steady flow). (39.25) d d t = ( v )  (steady flow).  {:(39.25)(d)/((d)t)=( vec(v)* vec(grad))quad" (steady flow). ":}\begin{equation*} \frac{\mathrm{d}}{\mathrm{~d} t}=(\vec{v} \cdot \vec{\nabla}) \quad \text { (steady flow). } \tag{39.25} \end{equation*}(39.25)d dt=(v) (steady flow). 
This simply says that when the flow is steady, the rate of change of quantities with time for the comoving observer is given by the spatial derivative along the flow lines.
For the perfect fluid in a state of steady flow, eqn (39.25) implies two thermodynamic equations for the enthalpy, one for the comoving frame and one for the stationary frame. These are, respectively,
(39.26) d h d t = 1 ρ 0 d p d t , ( perfect fluid, steady flow ) . ρ 0 ( v ) h = ( v ) p . (39.26) d h d t = 1 ρ 0 d p d t , (  perfect fluid,   steady flow  ) . ρ 0 ( v ) h = ( v ) p {:(39.26){:[(dh)/((d)t)=(1)/(rho_(0))((d)p)/((d)t)",",((" perfect fluid, ")/(" steady flow ")).],[rho_(0)( vec(v)* vec(grad))h=( vec(v)* vec(grad))p,". "]:}:}\begin{array}{cc} \frac{\mathrm{d} h}{\mathrm{~d} t}=\frac{1}{\rho_{0}} \frac{\mathrm{~d} p}{\mathrm{~d} t}, & \binom{\text { perfect fluid, }}{\text { steady flow }} . \tag{39.26}\\ \rho_{0}(\vec{v} \cdot \vec{\nabla}) h=(\vec{v} \cdot \vec{\nabla}) p & \text {. } \end{array}(39.26)dh dt=1ρ0 dp dt,( perfect fluid,  steady flow ).ρ0(v)h=(v)p
We can use the latter of these to obtain a more memorable version of Bernoulli's equation (eqn 39.18) for steady flow in a perfect fluid.

Example 39.6

Dot Bernoulli's equation, in the form of eqn 39.18, with the velocity vector to obtain
(39.27) v v t + v ( 1 2 v 2 + Φ ) + v p ρ 0 v ( v × ω ) = 0 (39.27) v v t + v 1 2 v 2 + Φ + v p ρ 0 v ( v × ω ) = 0 {:(39.27) vec(v)*(del( vec(v)))/(del t)+ vec(v)* vec(grad)((1)/(2)v^(2)+Phi)+ vec(v)*(( vec(grad))p)/(rho_(0))- vec(v)*( vec(v)xx vec(omega))=0:}\begin{equation*} \vec{v} \cdot \frac{\partial \vec{v}}{\partial t}+\vec{v} \cdot \vec{\nabla}\left(\frac{1}{2} v^{2}+\Phi\right)+\vec{v} \cdot \frac{\vec{\nabla} p}{\rho_{0}}-\vec{v} \cdot(\vec{v} \times \vec{\omega})=0 \tag{39.27} \end{equation*}(39.27)vvt+v(12v2+Φ)+vpρ0v(v×ω)=0
Spot that v v × ω = 0 v v × ω = 0 vec(v)* vec(v)xx omega=0\vec{v} \cdot \vec{v} \times \omega=0vv×ω=0 and then consider the case for steady flow by setting / t = 0 / t = 0 del//del t=0\partial / \partial t=0/t=0 to obtain
0 = v ( 1 2 v 2 + Φ ) + v h (39.28) = v ( 1 2 v 2 + Φ + h ) 0 = v 1 2 v 2 + Φ + v h (39.28) = v 1 2 v 2 + Φ + h {:[0= vec(v)* vec(grad)((1)/(2)v^(2)+Phi)+ vec(v)* vec(grad)h],[(39.28)= vec(v)* vec(grad)((1)/(2)v^(2)+Phi+h)]:}\begin{align*} 0 & =\vec{v} \cdot \vec{\nabla}\left(\frac{1}{2} v^{2}+\Phi\right)+\vec{v} \cdot \vec{\nabla} h \\ & =\vec{v} \cdot \vec{\nabla}\left(\frac{1}{2} v^{2}+\Phi+h\right) \tag{39.28} \end{align*}0=v(12v2+Φ)+vh(39.28)=v(12v2+Φ+h)
where we have used the second of eqn 39.26 to substitute the pressure with the enthalpy for the case of a perfect fluid. This resulting expression requires a little unpacking at this stage.
From the last example, we have a result given in terms of the operator v v vec(v)* vec(grad)\vec{v} \cdot \vec{\nabla}v appropriate for the stationary observer. Since this is steady flow, we can switch to the comoving frame using the rule that v d d t v d d t vec(v)* vec(grad)rarr((d))/((d)t)\vec{v} \cdot \vec{\nabla} \rightarrow \frac{\mathrm{~d}}{\mathrm{~d} t}v d dt as we move along the flow lines. This gives an alternative form for Bernoulli's equation for steady flow of a perfect fluid in the comoving frame as
(39.29) d d t ( 1 2 v 2 + Φ + h ) = 0 ( perfect fluid, steady flow ) . (39.29) d d t 1 2 v 2 + Φ + h = 0 (  perfect fluid,   steady flow  ) . {:(39.29)(d)/((d)t)((1)/(2)v^(2)+Phi+h)=0quad((" perfect fluid, ")/(" steady flow ")).:}\begin{equation*} \frac{\mathrm{d}}{\mathrm{~d} t}\left(\frac{1}{2} v^{2}+\Phi+h\right)=0 \quad\binom{\text { perfect fluid, }}{\text { steady flow }} . \tag{39.29} \end{equation*}(39.29)d dt(12v2+Φ+h)=0( perfect fluid,  steady flow ).
Notice how the terms in the bracket represent kinetic energy, gravitational energy and enthalpy. We conclude that for steady flow, Bernoulli's equation, which we introduced as a restatement of Euler's equation, simply expresses conservation of energy. It applies for the comoving observer, which is to say that conservation of energy in the form
(39.30) 1 2 v 2 + Φ + h = const. ( perfect fluid, steady flow ) (39.30) 1 2 v 2 + Φ + h =  const.  (  perfect fluid,   steady flow  ) {:(39.30)(1)/(2)v^(2)+Phi+h=" const. "quad((" perfect fluid, ")/(" steady flow ")):}\begin{equation*} \frac{1}{2} v^{2}+\Phi+h=\text { const. } \quad\binom{\text { perfect fluid, }}{\text { steady flow }} \tag{39.30} \end{equation*}(39.30)12v2+Φ+h= const. ( perfect fluid,  steady flow )
applies along the flow lines. Since h = u + p / ρ 0 h = u + p / ρ 0 h=u+p//rho_(0)h=u+p / \rho_{0}h=u+p/ρ0, we can rewrite the conservation of energy equation with pressure included explicitly. That is, along the flow lines we have that Bernoulli's equation reads
(39.31) 1 2 v 2 + Φ + u + p ρ 0 = const. ( perfect fluid steady flow ) (39.31) 1 2 v 2 + Φ + u + p ρ 0 =  const.  (  perfect fluid   steady flow  ) {:(39.31)(1)/(2)v^(2)+Phi+u+(p)/(rho_(0))=" const. "quad((" perfect fluid ")/(" steady flow ")):}\begin{equation*} \frac{1}{2} v^{2}+\Phi+u+\frac{p}{\rho_{0}}=\text { const. } \quad\binom{\text { perfect fluid }}{\text { steady flow }} \tag{39.31} \end{equation*}(39.31)12v2+Φ+u+pρ0= const. ( perfect fluid  steady flow )
To complete our look at the energetics of the fluid we consider an irrotational flow, where × v = 0 × v = 0 vec(grad)xx vec(v)=0\vec{\nabla} \times \vec{v}=0×v=0. In this case we can introduce a velocity potential ϕ ϕ phi\phiϕ via
(39.32) v = ϕ , (39.32) v = ϕ , {:(39.32) vec(v)= vec(grad)phi",":}\begin{equation*} \vec{v}=\vec{\nabla} \phi, \tag{39.32} \end{equation*}(39.32)v=ϕ,
which has the property × ϕ = 0 × ϕ = 0 vec(grad)xx vec(grad)phi=0\vec{\nabla} \times \vec{\nabla} \phi=0×ϕ=0. Let's see how this works in Bernoulli's equation.
Example 39.7
From the first law for enthalpy we have d h = T d s + d p / ρ 0 d h = T d s + d p / ρ 0 dh=Tds+dp//rho_(0)\mathrm{d} h=T \mathrm{~d} s+\mathrm{d} p / \rho_{0}dh=T ds+dp/ρ0. Taking derivatives with respect to the spatial coordinates x i x i x^(i)x^{i}xi we have
(39.33) h = T s + 1 ρ 0 p (39.33) h = T s + 1 ρ 0 p {:(39.33) vec(grad)h=T vec(grad)s+(1)/(rho_(0))* vec(grad)p:}\begin{equation*} \vec{\nabla} h=T \vec{\nabla} s+\frac{1}{\rho_{0}} \cdot \vec{\nabla} p \tag{39.33} \end{equation*}(39.33)h=Ts+1ρ0p
If the flow is isentropic the middle term falls out and we have h = p / ρ 0 h = p / ρ 0 vec(grad)h= vec(grad)p//rho_(0)\vec{\nabla} h=\vec{\nabla} p / \rho_{0}h=p/ρ0. As a result, we now write an expression for unsteady, isentropic, irrotational flow of a perfect fluid. Once again, start with Bernoulli's equation
(39.34) 0 = v t + ( 1 2 v 2 + Φ ) + p ρ 0 ( v × ω ) , (39.34) 0 = v t + 1 2 v 2 + Φ + p ρ 0 ( v × ω ) , {:(39.34)0=(del( vec(v)))/(del t)+ vec(grad)((1)/(2)v^(2)+Phi)+(( vec(grad))p)/(rho_(0))-( vec(v)xx vec(omega))",":}\begin{equation*} 0=\frac{\partial \vec{v}}{\partial t}+\vec{\nabla}\left(\frac{1}{2} v^{2}+\Phi\right)+\frac{\vec{\nabla} p}{\rho_{0}}-(\vec{v} \times \vec{\omega}), \tag{39.34} \end{equation*}(39.34)0=vt+(12v2+Φ)+pρ0(v×ω),
and then use v = ϕ , ω = 0 v = ϕ , ω = 0 vec(v)= vec(grad)phi, vec(omega)=0\vec{v}=\vec{\nabla} \phi, \vec{\omega}=0v=ϕ,ω=0 and h = p / ρ 0 h = p / ρ 0 vec(grad)h= vec(grad)p//rho_(0)\vec{\nabla} h=\vec{\nabla} p / \rho_{0}h=p/ρ0 to say
(39.35) 0 = ϕ t + ( 1 2 v 2 + Φ ) + h (39.36) = ( 1 2 v 2 + Φ + h + ϕ t ) (39.35) 0 = ϕ t + 1 2 v 2 + Φ + h (39.36) = 1 2 v 2 + Φ + h + ϕ t {:[(39.35)0= vec(grad)(del phi)/(del t)+ vec(grad)((1)/(2)v^(2)+Phi)+ vec(grad)h],[(39.36)= vec(grad)((1)/(2)v^(2)+Phi+h+(del phi)/(del t))]:}\begin{align*} 0 & =\vec{\nabla} \frac{\partial \phi}{\partial t}+\vec{\nabla}\left(\frac{1}{2} v^{2}+\Phi\right)+\vec{\nabla} h \tag{39.35}\\ & =\vec{\nabla}\left(\frac{1}{2} v^{2}+\Phi+h+\frac{\partial \phi}{\partial t}\right) \tag{39.36} \end{align*}(39.35)0=ϕt+(12v2+Φ)+h(39.36)=(12v2+Φ+h+ϕt)
To guarantee that the right-hand side of this last equation vanishes, we part in the bracket must be constant in space.
The result of the last example is Bernoulli's equation for unsteady flow of an irrotational, perfect fluid. Notice how the operator on the bracket was vec(grad)\vec{\nabla} and not v v vec(v)* vec(grad)\vec{v} \cdot \vec{\nabla}v. This implies that the equation does not just apply along the flow lines (identified with v v vec(v)*grad\vec{v} \cdot \nablav ) but everywhere. That is to say, throughout the fluid, we have
(39.37) ( 1 2 v 2 + Φ + h + ϕ t ) = const. ( irrotational perfect fluid unsteady flow ) (39.37) 1 2 v 2 + Φ + h + ϕ t =  const.  (  irrotational perfect fluid   unsteady flow  ) {:(39.37)((1)/(2)v^(2)+Phi+h+(del phi)/(del t))=" const. "quad((" irrotational perfect fluid ")/(" unsteady flow ")):}\begin{equation*} \left(\frac{1}{2} v^{2}+\Phi+h+\frac{\partial \phi}{\partial t}\right)=\text { const. } \quad\binom{\text { irrotational perfect fluid }}{\text { unsteady flow }} \tag{39.37} \end{equation*}(39.37)(12v2+Φ+h+ϕt)= const. ( irrotational perfect fluid  unsteady flow )
This differs from the steady flow version through the time derivative of the velocity potential and in its realm of applicability.

39.3 Energy-momentum tensor

We have already seen the importance of the energy-momentum tensor. Here we consider its form in the non-relativistic fluid. The main message here is that we can solve many matter-field problems using the key equation
(39.38) T = 0 (39.38) T = 0 {:(39.38)grad*T=0:}\begin{equation*} \nabla \cdot \boldsymbol{T}=0 \tag{39.38} \end{equation*}(39.38)T=0
which expresses conservation of energy-momentum. 10 10 ^(10){ }^{10}10
Recall that the energy-momentum tensor for the relativistic fluid was given by
(39.39) T = ( ρ + p ) v v + p g (39.39) T = ( ρ + p ) v v + p g {:(39.39)T=(rho+p)v ox v+pg:}\begin{equation*} \boldsymbol{T}=(\rho+p) \boldsymbol{v} \otimes \boldsymbol{v}+p \boldsymbol{g} \tag{39.39} \end{equation*}(39.39)T=(ρ+p)vv+pg
where 4 -vectors are used for velocity v v v\boldsymbol{v}v and g g g\boldsymbol{g}g is the metric. Here ρ ρ rho\rhoρ is the energy density of the fluid (as measured in its local rest frame). In general, it will receive contributions from the mass density ρ 0 ρ 0 rho_(0)\rho_{0}ρ0 and also the internal energy density ρ 0 u ρ 0 u rho_(0)u\rho_{0} uρ0u, where u u uuu is the specific internal energy of the fluid. 11 11 ^(11){ }^{11}11 In the case of flat spacetime, we have g = η g = η g=eta\boldsymbol{g}=\boldsymbol{\eta}g=η and so, if fluid has a non-relativistic velocity v v v\boldsymbol{v}v, we have ρ p / c 2 ρ p / c 2 rho≫p//c^(2)\rho \gg p / c^{2}ρp/c2 and
(39.40) T = ρ v v + p η (39.40) T = ρ v v + p η {:(39.40)T=rho v ox v+p eta:}\begin{equation*} \boldsymbol{T}=\rho \boldsymbol{v} \otimes \boldsymbol{v}+p \boldsymbol{\eta} \tag{39.40} \end{equation*}(39.40)T=ρvv+pη
or, in components,
(39.41) T μ ν = ρ v μ v ν + p η μ ν (39.41) T μ ν = ρ v μ v ν + p η μ ν {:(39.41)T^(mu nu)=rhov^(mu)v^(nu)+peta^(mu nu):}\begin{equation*} T^{\mu \nu}=\rho v^{\mu} v^{\nu}+p \eta^{\mu \nu} \tag{39.41} \end{equation*}(39.41)Tμν=ρvμvν+pημν

Example 39.8

The last expression for T T T\boldsymbol{T}T can be plugged into the key equation T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0 to yield
(39.42) T μ ν x μ = ρ x μ v μ v ν + ρ v μ x μ v ν + ρ v μ v ν x μ + p x μ η μ ν = 0 (39.42) T μ ν x μ = ρ x μ v μ v ν + ρ v μ x μ v ν + ρ v μ v ν x μ + p x μ η μ ν = 0 {:(39.42)(delT^(mu nu))/(delx^(mu))=(del rho)/(delx^(mu))v^(mu)v^(nu)+rho(delv^(mu))/(delx^(mu))v^(nu)+rhov^(mu)(delv^(nu))/(delx^(mu))+(del p)/(delx^(mu))eta^(mu nu)=0:}\begin{equation*} \frac{\partial T^{\mu \nu}}{\partial x^{\mu}}=\frac{\partial \rho}{\partial x^{\mu}} v^{\mu} v^{\nu}+\rho \frac{\partial v^{\mu}}{\partial x^{\mu}} v^{\nu}+\rho v^{\mu} \frac{\partial v^{\nu}}{\partial x^{\mu}}+\frac{\partial p}{\partial x^{\mu}} \eta^{\mu \nu}=0 \tag{39.42} \end{equation*}(39.42)Tμνxμ=ρxμvμvν+ρvμxμvν+ρvμvνxμ+pxμημν=0
Next, we use the non-relativistic expression for the components of velocity: v μ = v μ = v^(mu)=v^{\mu}=vμ= ( 1 , v ) ( 1 , v ) (1, vec(v))(1, \vec{v})(1,v), to write
T μ ν x μ = ρ t v ν + v ρ v ν + ρ v v ν + ρ v ν t + ρ v v ν p t δ ν t + ( p ) i δ ν i = 0 . T μ ν x μ = ρ t v ν + v ρ v ν + ρ v v ν + ρ v ν t + ρ v v ν p t δ ν t + ( p ) i δ ν i = 0 . (delT^(mu nu))/(delx^(mu))=(del rho)/(del t)v^(nu)+ vec(v)* vec(grad)rhov^(nu)+rho vec(grad)* vec(v)v^(nu)+rho(delv^(nu))/(del t)+rho vec(v)* vec(grad)v^(nu)-(del p)/(del t)delta^(nu)_(t)+( vec(grad)p)^(i)delta^(nu)_(i)=0.quad\frac{\partial T^{\mu \nu}}{\partial x^{\mu}}=\frac{\partial \rho}{\partial t} v^{\nu}+\vec{v} \cdot \vec{\nabla} \rho v^{\nu}+\rho \vec{\nabla} \cdot \vec{v} v^{\nu}+\rho \frac{\partial v^{\nu}}{\partial t}+\rho \vec{v} \cdot \vec{\nabla} v^{\nu}-\frac{\partial p}{\partial t} \delta^{\nu}{ }_{t}+(\vec{\nabla} p)^{i} \delta^{\nu}{ }_{i}=0 . \quadTμνxμ=ρtvν+vρvν+ρvvν+ρvνt+ρvvνptδνt+(p)iδνi=0. (39.43)
When ν = t ν = t nu=t\nu=tν=t we have 12 12 ^(12){ }^{12}12
p t = ρ t + v ρ + ρ v (39.44) = ρ t + ( ρ v ) p t = ρ t + v ρ + ρ v (39.44) = ρ t + ( ρ v ) {:[(del p)/(del t)=(del rho)/(del t)+ vec(v)* vec(grad)rho+rho vec(grad)* vec(v)],[(39.44)=(del rho)/(del t)+ vec(grad)*(rho vec(v))]:}\begin{align*} \frac{\partial p}{\partial t} & =\frac{\partial \rho}{\partial t}+\vec{v} \cdot \vec{\nabla} \rho+\rho \vec{\nabla} \cdot \vec{v} \\ & =\frac{\partial \rho}{\partial t}+\vec{\nabla} \cdot(\rho \vec{v}) \tag{39.44} \end{align*}pt=ρt+vρ+ρv(39.44)=ρt+(ρv)
This is a statement of conservation of energy. From the first law we see the left-hand side gives p / t = ρ 0 h / t p / t = ρ 0 h / t del p//del t=rho_(0)del h//del t\partial p / \partial t=\rho_{0} \partial h / \partial tp/t=ρ0h/t, telling us about sources and sinks of energy. The righthand side expresses the local flow of energy density J J grad*J\boldsymbol{\nabla} \cdot \boldsymbol{J}J. In the absence of energy sources, we have J = 0 J = 0 grad*J=0\boldsymbol{\nabla} \cdot \boldsymbol{J}=0J=0.
Considering next the spatial components of eqn 39.43 (by setting ν = i ν = i nu=i\nu=iν=i ), we find
(39.45) ρ t v i + ( v ρ ) v i + ( ρ v ) v i + ρ v i t + ρ v v i + ( p ) i = 0 (39.45) ρ t v i + ( v ρ ) v i + ( ρ v ) v i + ρ v i t + ρ v v i + ( p ) i = 0 {:(39.45)(del rho)/(del t)v^(i)+( vec(v)* vec(grad)rho)v^(i)+(rho vec(grad)* vec(v))v^(i)+rho(delv^(i))/(del t)+rho vec(v)* vec(grad)v^(i)+( vec(grad)p)^(i)=0:}\begin{equation*} \frac{\partial \rho}{\partial t} v^{i}+(\vec{v} \cdot \vec{\nabla} \rho) v^{i}+(\rho \vec{\nabla} \cdot \vec{v}) v^{i}+\rho \frac{\partial v^{i}}{\partial t}+\rho \vec{v} \cdot \vec{\nabla} v^{i}+(\vec{\nabla} p)^{i}=0 \tag{39.45} \end{equation*}(39.45)ρtvi+(vρ)vi+(ρv)vi+ρvit+ρvvi+(p)i=0
If we assume that the internal energy density contribution to ρ ρ rho\rhoρ is small in the nonrelativistic limit, then the first three terms express local conservation of mass-density. Setting them to zero (i.e. assuming conservation of mass), the final three terms yield
(39.46) ρ v t + ρ v v = p (39.46) ρ v t + ρ v v = p {:(39.46)rho(del( vec(v)))/(del t)+rho vec(v)* vec(grad) vec(v)=- vec(grad)p:}\begin{equation*} \rho \frac{\partial \vec{v}}{\partial t}+\rho \vec{v} \cdot \vec{\nabla} \vec{v}=-\vec{\nabla} p \tag{39.46} \end{equation*}(39.46)ρvt+ρvv=p
which is Euler's equation.
We see how the conservation of energy-momentum approach yields up both (i) the conservation of mass-energy and (ii) the equations of motion. We shall find the same idea applies for relativistic fluids, allowing us to extract a wealth of information from T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0.
11 11 ^(11){ }^{11}11 Restoring factors of c c ccc, we have an energy density ρ c 2 = ρ 0 c 2 + ρ 0 u ρ c 2 = ρ 0 c 2 + ρ 0 u rhoc^(2)=rho_(0)c^(2)+rho_(0)u\rho c^{2}=\rho_{0} c^{2}+\rho_{0} uρc2=ρ0c2+ρ0u. In natural units, we write ρ = ρ 0 ( 1 + u ) ρ = ρ 0 ( 1 + u ) rho=rho_(0)(1+u)\rho=\rho_{0}(1+u)ρ=ρ0(1+u).
12 12 ^(12){ }^{12}12 The untidy justification presented here can be made more respectable by projecting out parts of the divergence using the velocity vector. That method is presented in the next section.

39.4 Relativistic fluids

We shall apply the same methodology that we applied to the nonrelativistic fluid to its relativistic counterpart. In particular, we require relativistic upgrades of the appropriate derivatives, expressions for energy and for the energy-momentum tensor.
Mathematically, relativistic fluids are described by an energy density and a set of flow lines. The flow lines are the integral curves of the velocity field 13 u 13 u ^(13)u{ }^{13} \boldsymbol{u}13u. The (3+1)-dimensional derivative along the flow lines is given by the covariant derivative u = u u = u u*grad=grad_(u)\boldsymbol{u} \cdot \boldsymbol{\nabla}=\boldsymbol{\nabla}_{\boldsymbol{u}}u=u. In general, the flow lines are not geodesics described by u u = 0 u u = 0 grad_(u)u=0\boldsymbol{\nabla}_{\boldsymbol{u}} \boldsymbol{u}=0uu=0. We shall see that the equation of motion (a relativistic Euler equation) tells us directly how much the flow lines depart from being geodesics. The departure is caused by gradients in pressure.
The relativistic fluid in which we're interested is unlikely to be very fast-moving water, or similar. Rather, we are interested in the cosmological fluid filling spacetime, or the energetic matter inside stars. This is made up, principally of massive particles we'll call 14 14 ^(14){ }^{14}14 baryons. We also need to be slightly more careful about our definitions of thermodynamic variables. Let's define a set of objects n , ρ , p , T s n , ρ , p , T s n,rho,p,Tsn, \rho, p, T sn,ρ,p,Ts. These are scalar fields and so vary as a function of position in spacetime. In the rest frame of an element of fluid, they correspond to the results of measurements of the following physical quantities: 15 15 ^(15){ }^{15}15
  • The number density of baryons is n n nnn. If the average rest mass per baryon is m ¯ b m ¯ b bar(m)_(b)\bar{m}_{\mathrm{b}}m¯b, then the rest-mass density of baryons is ρ 0 = m ¯ b n ρ 0 = m ¯ b n rho_(0)= bar(m)_(b)n\rho_{0}=\bar{m}_{\mathrm{b}} nρ0=m¯bn.
  • The total energy density is ρ ρ rho\rhoρ. This includes the mass-energy of baryons ρ 0 ρ 0 rho_(0)\rho_{0}ρ0 and internal energy. If the specific internal energy is u u uuu then we have a total energy density 16 16 ^(16){ }^{16}16
(39.48) ρ = ρ 0 ( 1 + u ) . (39.48) ρ = ρ 0 ( 1 + u ) . {:(39.48)rho=rho_(0)(1+u).:}\begin{equation*} \rho=\rho_{0}(1+u) . \tag{39.48} \end{equation*}(39.48)ρ=ρ0(1+u).
  • The isotropic pressure is p p ppp.
  • The temperature is T T TTT.
  • The entropy per baryon is s s sss.
In non-relativistic fluid dynamics, we had two useful rules, (i) the derivative rule: that we use d / d t d / d t d//dt\mathrm{d} / \mathrm{d} td/dt in the comoving frame and / t + v / t + v del//del t+ vec(v)* vec(grad)\partial / \partial t+\vec{v} \cdot \vec{\nabla}/t+v otherwise, and (ii) the divergence rule v = 1 / V ( d V / d t ) v = 1 / V ( d V / d t ) vec(grad)* vec(v)=1//V(dV//dt)\vec{\nabla} \cdot \vec{v}=1 / V(\mathrm{~d} V / \mathrm{d} t)v=1/V( dV/dt). These have memorable relativistic counterparts: (i) a comoving observer measures rates of change using their proper time 17 D / d τ 17 D / d τ ^(17)D//dtau{ }^{17} \mathrm{D} / \mathrm{d} \tau17D/dτ, while the natural upgrade for the convective derivative is the covariant derivative u u grad_(u)\boldsymbol{\nabla}_{\boldsymbol{u}}u; (ii) the derivative rule becomes u = ( 1 / V ) d V / d τ u = ( 1 / V ) d V / d τ grad*u=(1//V)dV//dtau\boldsymbol{\nabla} \cdot \boldsymbol{u}=(1 / V) \mathrm{d} V / \mathrm{d} \tauu=(1/V)dV/dτ.

Example 39.9

Let's examine the rules in more detail. For rule (i), we consider a particle that moves along a world line parametrized by proper time τ τ tau\tauτ which has a tangent vector u u u\boldsymbol{u}u. In the comoving frame in flat space, we recall that the rate of change of a vector quantity like A A A\boldsymbol{A}A is
(39.50) D A d τ = u A = u α ( A β x α ) e β (39.50) D A d τ = u A = u α A β x α e β {:(39.50)(DA)/((d)tau)=grad_(u)A=u^(alpha)((delA^(beta))/(delx^(alpha)))e_(beta):}\begin{equation*} \frac{\mathrm{D} \boldsymbol{A}}{\mathrm{~d} \tau}=\nabla_{u} \boldsymbol{A}=u^{\alpha}\left(\frac{\partial A^{\beta}}{\partial x^{\alpha}}\right) \boldsymbol{e}_{\beta} \tag{39.50} \end{equation*}(39.50)DA dτ=uA=uα(Aβxα)eβ
where we've used the fact that Γ μ α β = 0 Γ μ α β = 0 Gamma^(mu)_(alpha beta)=0\Gamma^{\mu}{ }_{\alpha \beta}=0Γμαβ=0 for flat space. Expanding components we have
(39.51) D A d τ = u A = ( u t t + u i i ) A (39.51) D A d τ = u A = u t t + u i i A {:(39.51)(DA)/((d)tau)=grad_(u)A=(u^(t)(del)/(del t)+u^(i) vec(grad)_(i))A:}\begin{equation*} \frac{\mathrm{D} \boldsymbol{A}}{\mathrm{~d} \tau}=\boldsymbol{\nabla}_{\boldsymbol{u}} \boldsymbol{A}=\left(u^{t} \frac{\partial}{\partial t}+u^{i} \vec{\nabla}_{i}\right) \boldsymbol{A} \tag{39.51} \end{equation*}(39.51)DA dτ=uA=(utt+uii)A
which, recalling that u u u\boldsymbol{u}u has components ( γ ( u ) , γ ( u ) u ) ( γ ( u ) , γ ( u ) u ) (gamma(u),gamma(u) vec(u))(\gamma(u), \gamma(u) \vec{u})(γ(u),γ(u)u), illustrates how the covariant derivative, with its directional property in spacetime, supplies the natural relativistic analogue of our previous expression for the convective derivative.
For rule (ii), we seek an upgrade of the rule to convert between the divergence of velocity and fluid volume whose form, we recall, in non-relativistic Euclidean space was 1 V d d t = v 1 V d d t = v (1)/(V)((d))/((d)t)=grad* vec(v)\frac{1}{V} \frac{\mathrm{~d}}{\mathrm{~d} t}=\nabla \cdot \vec{v}1V d dt=v. To justify our proposed upgrade, we note that non-relativistically u t = dt t d τ = 1 u t = dt t d τ = 1 u^(t)=(dtt)/((d)tau)=1u^{t}=\frac{\mathrm{dt} t}{\mathrm{~d} \tau}=1ut=dtt dτ=1 and so, motivated by rule (i), we can suggest the compatible expression 18 18 ^(18){ }^{18}18
(39.52) 1 V d V d τ = u (39.52) 1 V d V d τ = u {:(39.52)(1)/(V)((d)V)/((d)tau)=grad*u:}\begin{equation*} \frac{1}{V} \frac{\mathrm{~d} V}{\mathrm{~d} \tau}=\boldsymbol{\nabla} \cdot \boldsymbol{u} \tag{39.52} \end{equation*}(39.52)1V dV dτ=u
where u u u\boldsymbol{u}u is the 4 -velocity of the fluid. Since our expressions are valid covariant statements in flat spacetime, they must, by the principle of covariance, be the case in curved spacetime. With this in hand, we can check the conservation equation.
As we've seen previously, the relativistic energy-momentum tensor T T T\boldsymbol{T}T for a perfect fluid is given by
(39.53) T = ( ρ + p ) u u + p g , T μ ν = ( ρ + p ) u μ u ν + p g μ ν (39.53) T = ( ρ + p ) u u + p g , T μ ν = ( ρ + p ) u μ u ν + p g μ ν {:(39.53)T=(rho+p)u ox u+pg","quadT^(mu nu)=(rho+p)u^(mu)u^(nu)+pg^(mu nu):}\begin{equation*} \boldsymbol{T}=(\rho+p) \boldsymbol{u} \otimes \boldsymbol{u}+p \boldsymbol{g}, \quad T^{\mu \nu}=(\rho+p) u^{\mu} u^{\nu}+p g^{\mu \nu} \tag{39.53} \end{equation*}(39.53)T=(ρ+p)uu+pg,Tμν=(ρ+p)uμuν+pgμν
In the fluid's rest frame, we have T 00 = ρ , T j 0 = 0 T 00 = ρ , T j 0 = 0 T^(00)=rho,T^(j0)=0T^{00}=\rho, T^{j 0}=0T00=ρ,Tj0=0 and T i j = p δ i k T i j = p δ i k T^(ij)=pdelta^(ik)T^{i j}=p \delta^{i k}Tij=pδik. The conservation equation takes the form
(39.54) T = 0 , T ; ν μ ν = 0 (39.54) T = 0 , T ; ν μ ν = 0 {:(39.54)grad*T=0","quadT_(;nu)^(mu nu)=0:}\begin{equation*} \boldsymbol{\nabla} \cdot \boldsymbol{T}=0, \quad T_{; \nu}^{\mu \nu}=0 \tag{39.54} \end{equation*}(39.54)T=0,T;νμν=0
where we've generalized to the version appropriate to curved spacetime using the semicolon notation.
Example 39.10
As in Chapter 12, take the divergence of T T T\boldsymbol{T}T and use the Leibniz rule to find the following:
(39.55) ( ρ + p ) , β u α u β + ( ρ + p ) ( u ; β α u β + u α u ; β β ) + p , β g α β = 0 . (39.55) ( ρ + p ) , β u α u β + ( ρ + p ) u ; β α u β + u α u ; β β + p , β g α β = 0 . {:(39.55)(rho+p)_(,beta)u^(alpha)u^(beta)+(rho+p)(u_(;beta)^(alpha)u^(beta)+u^(alpha)u_(;beta)^(beta))+p","betag^(alpha beta)=0.:}\begin{equation*} (\rho+p)_{, \beta} u^{\alpha} u^{\beta}+(\rho+p)\left(u_{; \beta}^{\alpha} u^{\beta}+u^{\alpha} u_{; \beta}^{\beta}\right)+p, \beta g^{\alpha \beta}=0 . \tag{39.55} \end{equation*}(39.55)(ρ+p),βuαuβ+(ρ+p)(u;βαuβ+uαu;ββ)+p,βgαβ=0.
This expression is very useful in what follows.
With the above concepts in place, we can discuss the key ideas that govern relativistic fluid dynamics.
I: Baryon conservation is perhaps the most primitive conservation law we can specify. It is written in the comoving frame as 19 19 ^(19){ }^{19}19
(39.57) d ( n V ) d τ = 0 (39.57) d ( n V ) d τ = 0 {:(39.57)(d(nV))/(dtau)=0:}\begin{equation*} \frac{\mathrm{d}(n V)}{\mathrm{d} \tau}=0 \tag{39.57} \end{equation*}(39.57)d(nV)dτ=0
In general curved spacetime, this is written in terms of the covariant derivative as
(39.58) ( n u ) = 0 , ( n u α ) ; α = 0 (39.58) ( n u ) = 0 , n u α ; α = 0 {:(39.58)grad*(nu)=0","quad(nu^(alpha))_(;alpha)=0:}\begin{equation*} \boldsymbol{\nabla} \cdot(n \boldsymbol{u})=0, \quad\left(n u^{\alpha}\right)_{; \alpha}=0 \tag{39.58} \end{equation*}(39.58)(nu)=0,(nuα);α=0
where the semicolon notation has been employed in the component version on the right.
18 18 ^(18){ }^{18}18 Recall that v = v i , i v = v i , i vec(grad)* vec(v)=v^(i)_(,i)\vec{\nabla} \cdot \vec{v}=v^{i}{ }_{, i}v=vi,i and u = u = grad*u=\boldsymbol{\nabla} \cdot \boldsymbol{u}=u= u μ ; μ u μ ; μ u^(mu)_(;mu)u^{\mu}{ }_{; \mu}uμ;μ.
20 20 ^(20){ }^{20}20 Also recall that u u u u grad_(u)-=u*grad\nabla_{u} \equiv \boldsymbol{u} \cdot \boldsymbol{\nabla}uu.
21 21 ^(21){ }^{21}21 It's also useful to note that we also have
(39.61) u = 1 ρ 0 d ρ 0 d t . (39.61) u = 1 ρ 0 d ρ 0 d t . {:(39.61)grad*u=-(1)/(rho_(0))((d)rho_(0))/((d)t).:}\begin{equation*} \boldsymbol{\nabla} \cdot \boldsymbol{u}=-\frac{1}{\rho_{0}} \frac{\mathrm{~d} \rho_{0}}{\mathrm{~d} t} . \tag{39.61} \end{equation*}(39.61)u=1ρ0 dρ0 dt.
Example 39.11
We prove this using our rules. The comoving rest mass conservation equation becomes
(39.59) 0 = d n d τ + n V d V d τ (39.59) 0 = d n d τ + n V d V d τ {:(39.59)0=(dn)/((d)tau)+(n)/(V)((d)V)/((d)tau):}\begin{equation*} 0=\frac{\mathrm{d} n}{\mathrm{~d} \tau}+\frac{n}{V} \frac{\mathrm{~d} V}{\mathrm{~d} \tau} \tag{39.59} \end{equation*}(39.59)0=dn dτ+nV dV dτ
Consider the terms on the right, using rule (i) on the first term and rule (ii) on the second to get 20 20 ^(20){ }^{20}20
0 = u n + n ( u ) (39.60) = u n + n ( u ) = ( n u ) 0 = u n + n ( u ) (39.60) = u n + n ( u ) = ( n u ) {:[0=grad_(u)n+n(grad*u)],[(39.60)=u*grad n+n(grad*u)=grad*(nu)]:}\begin{align*} 0 & =\boldsymbol{\nabla}_{\boldsymbol{u}} n+n(\boldsymbol{\nabla} \cdot \boldsymbol{u}) \\ & =\boldsymbol{u} \cdot \boldsymbol{\nabla} n+n(\boldsymbol{\nabla} \cdot \boldsymbol{u})=\boldsymbol{\nabla} \cdot(n \boldsymbol{u}) \tag{39.60} \end{align*}0=un+n(u)(39.60)=un+n(u)=(nu)
The vanishing of the final expression, whose coordinate version is ( n u α ) ; α n u α ; α (nu^(alpha))_(;alpha)\left(n u^{\alpha}\right)_{; \alpha}(nuα);α is what we wanted to prove. 21 21 ^(21){ }^{21}21
II: Entropy conservation The amount of entropy in a volume V V VVV is S = n s V S = n s V S=nsVS=n s VS=nsV. In a perfect fluid, we assume that there is no heat flow between flow lines, which implies
(39.62) d ( n s V ) d τ 0 (39.62) d ( n s V ) d τ 0 {:(39.62)(d(nsV))/(dtau) >= 0:}\begin{equation*} \frac{\mathrm{d}(n s V)}{\mathrm{d} \tau} \geq 0 \tag{39.62} \end{equation*}(39.62)d(nsV)dτ0
This can be combined with the conservation of baryons d ( n V ) / d τ d ( n V ) / d τ d(nV)//dtau\mathrm{d}(n V) / \mathrm{d} \taud(nV)/dτ to allow us to conclude that d s d τ 0 d s d τ 0 (ds)/((d)tau) >= 0\frac{\mathrm{d} s}{\mathrm{~d} \tau} \geq 0ds dτ0, which is the second law of thermodynamics. In many cases, we consider a perfect fluid, where we have d s d τ = 0 d s d τ = 0 (ds)/((d)tau)=0\frac{\mathrm{d} s}{\mathrm{~d} \tau}=0ds dτ=0.
III: Energy conservation is expressed via T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0. As in the nonrelativistic example, this powerful equation actually outputs too much in the case that we substitute the expression for the energy-density of the relativistic fluid. That is to say it outputs the energy conservation and also the equation of motion. To restrict the result to energy conservation we use a contracted version of this equation
(39.63) u α T ; β α β = 0 (39.63) u α T ; β α β = 0 {:(39.63)u_(alpha)T_(;beta)^(alpha beta)=0:}\begin{equation*} u_{\alpha} T_{; \beta}^{\alpha \beta}=0 \tag{39.63} \end{equation*}(39.63)uαT;βαβ=0
This is clearly true and the projection along the velocity direction is enough to extract the energy conservation.

Example 39.12

We project eqn 39.55, describing T α β T α β T^(alpha beta)T^{\alpha \beta}Tαβ for the perfect fluid, along u α u α u_(alpha)u_{\alpha}uα and 22 22 ^(22){ }^{22}22 find
0 = ( ρ + p ) , β | u | 2 u β + ( ρ + p ) ( u α ; β u β u α + | u | 2 u β ; β ) + p , β u β 0 = ( ρ + p ) , β u β + ( ρ + p ) ( u α ; β u β u α u β ; β ) + p , β u β (39.64) = ρ , β u β + ( ρ + p ) ( u ; β α u β u α u β ; β ) . 0 = ( ρ + p ) , β | u | 2 u β + ( ρ + p ) u α ; β u β u α + | u | 2 u β ; β + p , β u β 0 = ( ρ + p ) , β u β + ( ρ + p ) u α ; β u β u α u β ; β + p , β u β (39.64) = ρ , β u β + ( ρ + p ) u ; β α u β u α u β ; β . {:[0=(rho+p)_(,beta)|u|^(2)u^(beta)+(rho+p)(u^(alpha)_(;beta)u^(beta)u_(alpha)+|u|^(2)u^(beta)_(;beta))+p","betau^(beta)0],[=-(rho+p)_(,beta)u^(beta)+(rho+p)(u^(alpha)_(;beta)u^(beta)u_(alpha)-u^(beta)_(;beta))+p_(,beta)u^(beta)],[(39.64)=-rho_(,beta)u^(beta)+(rho+p)(u_(;beta)^(alpha)u^(beta)u_(alpha)-u^(beta)_(;beta)).]:}\begin{align*} 0 & =(\rho+p){ }_{, \beta}|u|^{2} u^{\beta}+(\rho+p)\left(u^{\alpha}{ }_{; \beta} u^{\beta} u_{\alpha}+|u|^{2} u^{\beta}{ }_{; \beta}\right)+p, \beta u^{\beta} 0 \\ & =-(\rho+p)_{, \beta} u^{\beta}+(\rho+p)\left(u^{\alpha}{ }_{; \beta} u^{\beta} u_{\alpha}-u^{\beta}{ }_{; \beta}\right)+p_{, \beta} u^{\beta} \\ & =-\rho_{, \beta} u^{\beta}+(\rho+p)\left(u_{; \beta}^{\alpha} u^{\beta} u_{\alpha}-u^{\beta}{ }_{; \beta}\right) . \tag{39.64} \end{align*}0=(ρ+p),β|u|2uβ+(ρ+p)(uα;βuβuα+|u|2uβ;β)+p,βuβ0=(ρ+p),βuβ+(ρ+p)(uα;βuβuαuβ;β)+p,βuβ(39.64)=ρ,βuβ+(ρ+p)(u;βαuβuαuβ;β).
To break this down further, concentrate on the term u α ; β u β u α u α ; β u β u α u^(alpha)_(;beta)u^(beta)u_(alpha)u^{\alpha}{ }_{; \beta} u^{\beta} u_{\alpha}uα;βuβuα. Translating back into vector notation, this is rewritten as
(39.65) u α ; β u β u α u u u (39.65) u α ; β u β u α u u u {:(39.65)u^(alpha)_(;beta)u^(beta)u_(alpha)-=u*grad_(u)u:}\begin{equation*} u^{\alpha}{ }_{; \beta} u^{\beta} u_{\alpha} \equiv \boldsymbol{u} \cdot \nabla_{u} \boldsymbol{u} \tag{39.65} \end{equation*}(39.65)uα;βuβuαuuu
This is useful since we can spot that in the same way as for vectors in flat space, where we have v d v d x = 1 2 d v 2 d x v d v d x = 1 2 d v 2 d x v*(dv)/((d)x)=(1)/(2)((d)v^(2))/((d)x)\boldsymbol{v} \cdot \frac{\mathrm{d} v}{\mathrm{~d} x}=\frac{1}{2} \frac{\mathrm{~d} \boldsymbol{v}^{2}}{\mathrm{~d} x}vdv dx=12 dv2 dx, we have here that
(39.66) u u u = 1 2 u ( u 2 ) = 0 (39.66) u u u = 1 2 u u 2 = 0 {:(39.66)u*grad_(u)u=(1)/(2)grad_(u)(u^(2))=0:}\begin{equation*} \boldsymbol{u} \cdot \nabla_{u} \boldsymbol{u}=\frac{1}{2} \boldsymbol{\nabla}_{u}\left(\boldsymbol{u}^{2}\right)=0 \tag{39.66} \end{equation*}(39.66)uuu=12u(u2)=0
which again relies on u 2 = 1 u 2 = 1 u^(2)=-1\boldsymbol{u}^{2}=-1u2=1. This allows us to say
(39.67) ρ , β u β ( ρ + p ) u ; β β = 0 (39.67) ρ , β u β ( ρ + p ) u ; β β = 0 {:(39.67)-rho_(,beta)u^(beta)-(rho+p)u_(;beta)^(beta)=0:}\begin{equation*} -\rho_{, \beta} u^{\beta}-(\rho+p) u_{; \beta}^{\beta}=0 \tag{39.67} \end{equation*}(39.67)ρ,βuβ(ρ+p)u;ββ=0
Written in coordinate-free notation, the conclusion from the last example, for the projection of the equation T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0 along the velocity direction, is
(39.68) u ρ + ( ρ + p ) u = 0 (39.68) u ρ + ( ρ + p ) u = 0 {:(39.68)grad_(u)rho+(rho+p)grad*u=0:}\begin{equation*} \boldsymbol{\nabla}_{\boldsymbol{u}} \rho+(\rho+p) \boldsymbol{\nabla} \cdot \boldsymbol{u}=0 \tag{39.68} \end{equation*}(39.68)uρ+(ρ+p)u=0
This does not look much like conservation of energy, but it can be rewritten in a more enlightening form as shown in the next example.
Example 39.13
Using our rule (ii) u = ( 1 / V ) d V / d τ u = ( 1 / V ) d V / d τ grad*u=(1//V)dV//dtau\boldsymbol{\nabla} \cdot \boldsymbol{u}=(1 / V) \mathrm{d} V / \mathrm{d} \tauu=(1/V)dV/dτ we write
(39.69) d ρ d τ + ( ρ + p ) V d V d τ = 0 (39.69) d ρ d τ + ( ρ + p ) V d V d τ = 0 {:(39.69)(drho)/((d)tau)+((rho+p))/(V)*((d)V)/((d)tau)=0:}\begin{equation*} \frac{\mathrm{d} \rho}{\mathrm{~d} \tau}+\frac{(\rho+p)}{V} \cdot \frac{\mathrm{~d} V}{\mathrm{~d} \tau}=0 \tag{39.69} \end{equation*}(39.69)dρ dτ+(ρ+p)V dV dτ=0
Then, using d ( n V ) / d τ = 0 d ( n V ) / d τ = 0 d(nV)//dtau=0\mathrm{d}(n V) / \mathrm{d} \tau=0d(nV)/dτ=0 we have
(39.70) d ρ d τ ( ρ + p ) n d n d τ = 0 (39.70) d ρ d τ ( ρ + p ) n d n d τ = 0 {:(39.70)(drho)/((d)tau)-((rho+p))/(n)*((d)n)/((d)tau)=0:}\begin{equation*} \frac{\mathrm{d} \rho}{\mathrm{~d} \tau}-\frac{(\rho+p)}{n} \cdot \frac{\mathrm{~d} n}{\mathrm{~d} \tau}=0 \tag{39.70} \end{equation*}(39.70)dρ dτ(ρ+p)n dn dτ=0
We shall see very shortly that this expression does indeed describe energy conservation.
Let's link the result of the previous example to energy conservation. In relativity, the first law of thermodynamics may be expressed as 23 23 ^(23){ }^{23}23
(39.71) d ( Energy in a volume element with fixed number N of baryons ) = p d V + T d S (39.71) d (  Energy in a volume element   with fixed number  N  of baryons  ) = p d V + T d S {:(39.71)d((" Energy in a volume element ")/(" with fixed number "N" of baryons "))=-pdV+TdS:}\begin{equation*} \mathrm{d}\binom{\text { Energy in a volume element }}{\text { with fixed number } N \text { of baryons }}=-p \mathrm{~d} V+T \mathrm{~d} S \tag{39.71} \end{equation*}(39.71)d( Energy in a volume element  with fixed number N of baryons )=p dV+T dS
We evaluate
(39.72) d ( ρ N n ) = p d ( N n ) + T d ( N s ) (39.72) d ρ N n = p d N n + T d ( N s ) {:(39.72)d((rho N)/(n))=-pd((N)/(n))+Td(Ns):}\begin{equation*} \mathrm{d}\left(\frac{\rho N}{n}\right)=-p \mathrm{~d}\left(\frac{N}{n}\right)+T \mathrm{~d}(N s) \tag{39.72} \end{equation*}(39.72)d(ρNn)=p d(Nn)+T d(Ns)
where V = N / n V = N / n V=N//nV=N / nV=N/n. We note that the factors of N N NNN are common to all terms, so taking the derivatives with respect to n n nnn and rearranging, we obtain the first law of thermodynamics for our relativistic variables as
(39.73) d ρ = p + ρ n d n + n T d s (39.73) d ρ = p + ρ n d n + n T d s {:(39.73)drho=(p+rho)/(n)dn+nTds:}\begin{equation*} \mathrm{d} \rho=\frac{p+\rho}{n} \mathrm{~d} n+n T \mathrm{~d} s \tag{39.73} \end{equation*}(39.73)dρ=p+ρn dn+nT ds
Setting d s = 0 d s = 0 ds=0\mathrm{d} s=0ds=0 and taking derivatives with respect to the proper time τ τ tau\tauτ we obtain eqn 39.70.
IV: Momentum conservation As in the non-relativistic case, this comes from a projection of T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0. This time we project out the part of the divergence perpendicular to the velocity u u u\boldsymbol{u}u using the ( 0 , 2 ) ( 0 , 2 ) (0,2)(0,2)(0,2) projection-operator tensor 24 P 24 P ^(24)P{ }^{24} \boldsymbol{P}24P which has components P α μ = g α μ + u α u μ P α μ = g α μ + u α u μ P_(alpha mu)=g_(alpha mu)+u_(alpha)u_(mu)P_{\alpha \mu}=g_{\alpha \mu}+u_{\alpha} u_{\mu}Pαμ=gαμ+uαuμ
23 23 ^(23){ }^{23}23 Interesting here is that the differential in this expression can be interpreted as the exterior derivative d d d\boldsymbol{d}d. There is no assumption made in the first law about following the flow of the fluid.
24 24 ^(24){ }^{24}24 To find the part of a vector v v v\boldsymbol{v}v perpendicular to a velocity vector u u u\boldsymbol{u}u we write
v ( u v ) ( u u ) u (39.74) = v + ( u v ) u v ( u v ) ( u u ) u (39.74) = v + ( u v ) u {:[v-((u*v))/((u*u))u],[(39.74)=v+(u*v)u]:}\begin{align*} & \boldsymbol{v}-\frac{(\boldsymbol{u} \cdot \boldsymbol{v})}{(\boldsymbol{u} \cdot \boldsymbol{u})} \boldsymbol{u} \\ = & \boldsymbol{v}+(\boldsymbol{u} \cdot \boldsymbol{v}) \boldsymbol{u} \tag{39.74} \end{align*}v(uv)(uu)u(39.74)=v+(uv)u
Comparing the action of P P P\boldsymbol{P}P we find
P α μ v μ = v α + ( u μ v μ ) u α , ( 39.75 ) P α μ v μ = v α + u μ v μ u α , ( 39.75 ) P_(alpha mu)v^(mu)=v_(alpha)+(u_(mu)v^(mu))u_(alpha),quad(39.75)P_{\alpha \mu} v^{\mu}=v_{\alpha}+\left(u_{\mu} v^{\mu}\right) u_{\alpha}, \quad(39.75)Pαμvμ=vα+(uμvμ)uα,(39.75)
which is the down-index version of the simple prescription above
25 25 ^(25){ }^{25}25 This can be seen by expanding the ( 2 , 0 ) ( 2 , 0 ) (2,0)(2,0)(2,0) version of P P P\boldsymbol{P}P in the product P ( p ) P ( p ) P*(grad p)\boldsymbol{P} \cdot(\boldsymbol{\nabla} p)P(p) in components. Explicitly,
( g α β + u α u β ) p x β = g α β p x β + u α ( u p ) g α β + u α u β p x β = g α β p x β + u α ( u p ) {:[(g^(alpha beta)+u^(alpha)u^(beta))(del p)/(delx^(beta))],[=g^(alpha beta)(del p)/(delx^(beta))+u^(alpha)(u*grad p)]:}\begin{aligned} & \left(g^{\alpha \beta}+u^{\alpha} u^{\beta}\right) \frac{\partial p}{\partial x^{\beta}} \\ & =g^{\alpha \beta} \frac{\partial p}{\partial x^{\beta}}+u^{\alpha}(\boldsymbol{u} \cdot \boldsymbol{\nabla} p) \end{aligned}(gαβ+uαuβ)pxβ=gαβpxβ+uα(up)
which is the up version of the righthand side of eqn 39.78 , which can also be written as P ( p P ( p P(grad p\boldsymbol{P}(\boldsymbol{\nabla} pP(p, ) . Y o u c a n ) . Y o u c a n ).Youcan) . You can).Youcan then quickly check that P ( p , u ~ ) = 0 P ( p , u ~ ) = 0 P(grad p, tilde(u))=0\boldsymbol{P}(\boldsymbol{\nabla} p, \tilde{\boldsymbol{u}})=0P(p,u~)=0, which guarantees that u u u = 0 u u u = 0 u*grad_(u)u=0\boldsymbol{u} \cdot \boldsymbol{\nabla}_{u} \boldsymbol{u}=0uuu=0, as usual for an accelerated world line.
26 26 ^(26){ }^{26}26 The non-relativistic Euler equation is
ρ [ u t + ( u ) u ] = p ρ u t + ( u ) u = p rho[(del( vec(u)))/(del t)+(( vec(u))*( vec(grad)))( vec(u))]=- vec(grad)p\rho\left[\frac{\partial \vec{u}}{\partial t}+(\vec{u} \cdot \vec{\nabla}) \vec{u}\right]=-\vec{\nabla} pρ[ut+(u)u]=p
and write
(39.76) P α μ T ; ν μ ν = 0 (39.76) P α μ T ; ν μ ν = 0 {:(39.76)P_(alpha mu)T_(;nu)^(mu nu)=0:}\begin{equation*} P_{\alpha \mu} T_{; \nu}^{\mu \nu}=0 \tag{39.76} \end{equation*}(39.76)PαμT;νμν=0
This expresses momentum conservation and, as we shall see, results in the equation of motion for the fluid: the relativistic analogue of the Euler equation. The equation of motion is given as an equation for the flow lines.

Example 39.14

Applying the projection operator, we obtain
0 = P α μ T μ ν ; ν = 0 + P α μ ( ρ + p ) u μ ; ν u ν + p , ν P α μ (39.77) = ( ρ + p ) u α ; ν u ν + p , α + p , ν u ν u α 0 = P α μ T μ ν ; ν = 0 + P α μ ( ρ + p ) u μ ; ν u ν + p , ν P α μ (39.77) = ( ρ + p ) u α ; ν u ν + p , α + p , ν u ν u α {:[0=P_(alpha mu)T^(mu nu)_(;nu)],[=0+P_(alpha mu)(rho+p)u^(mu)_(;nu)u^(nu)+p_(,nu)P_(alpha mu)],[(39.77)=(rho+p)u_(alpha;nu)u^(nu)+p_(,alpha)+p_(,nu)u^(nu)u_(alpha)]:}\begin{align*} 0 & =P_{\alpha \mu} T^{\mu \nu}{ }_{; \nu} \\ & =0+P_{\alpha \mu}(\rho+p) u^{\mu}{ }_{; \nu} u^{\nu}+p_{, \nu} P_{\alpha \mu} \\ & =(\rho+p) u_{\alpha ; \nu} u^{\nu}+p_{, \alpha}+p_{, \nu} u^{\nu} u_{\alpha} \tag{39.77} \end{align*}0=PαμTμν;ν=0+Pαμ(ρ+p)uμ;νuν+p,νPαμ(39.77)=(ρ+p)uα;νuν+p,α+p,νuνuα
In vector notation, this looks like
(39.78) ( ρ + p ) u u ~ = [ p p + ( u p ) u ~ ] (39.78) ( ρ + p ) u u ~ = p p + u p u ~ {:(39.78)(rho+p)grad_(u) tilde(u)=-[grad^(p)p+(grad_(u)p)( tilde(u))]:}\begin{equation*} (\rho+p) \nabla_{u} \tilde{\boldsymbol{u}}=-\left[\nabla^{p} p+\left(\nabla_{\boldsymbol{u}} p\right) \tilde{\boldsymbol{u}}\right] \tag{39.78} \end{equation*}(39.78)(ρ+p)uu~=[pp+(up)u~]
This may also be written, in vector notation, using the (2,0) version of P , as 25 P , as 25 P,as^(25)\boldsymbol{P}, \mathrm{as}^{25}P,as25
(39.79) ( ρ + p ) u u + P p = 0 . (39.79) ( ρ + p ) u u + P p = 0 . {:(39.79)(rho+p)grad_(u)u+P*grad p=0.:}\begin{equation*} (\rho+p) \boldsymbol{\nabla}_{\boldsymbol{u}} \boldsymbol{u}+\boldsymbol{P} \cdot \boldsymbol{\nabla} p=0 . \tag{39.79} \end{equation*}(39.79)(ρ+p)uu+Pp=0.
The result is that the relativistic Euler equation is written as
(39.80) ( ρ + p ) u u = P p (39.80) ( ρ + p ) u u = P p {:(39.80)(rho+p)grad_(u)u=-P*grad p:}\begin{equation*} (\rho+p) \boldsymbol{\nabla}_{\boldsymbol{u}} \boldsymbol{u}=-\boldsymbol{P} \cdot \boldsymbol{\nabla} p \tag{39.80} \end{equation*}(39.80)(ρ+p)uu=Pp
Comparing this to the non-relativistic version 26 26 ^(26){ }^{26}26 we remember that the non-relativistic limit requires ρ p / c 2 ρ p / c 2 rho≫p//c^(2)\rho \gg p / c^{2}ρp/c2. Our rule (i) for swapping convective for covariant derivatives is seen on the left-hand side.
Important here is a comparison with the geodesic equation u u = u u = grad_(u)u=\nabla_{\boldsymbol{u}} \boldsymbol{u}=uu= 0 . We see from eqn 39.80 how in the relativistic fluid, it is pressure gradients p p grad p\nabla pp perpendicular to the velocity that causes the flow lines to deviate from describing geodesics. A fluid with a uniform pressure simply describes particles that all follow geodesics. A special case of this is the pressureless dust that we often use to describe the cosmological fluid. So, once more with emphasis: dust is a pressureless fluid where all particles follow geodesics.
V V V\mathbf{V}V : Equation of state The thermodynamics of a fluid is described by an equation of state. The nature of the equation of state depends on the circumstances and the resulting natural variables for the problem at hand.
Example 39.15
An example equation of state for the relativistic fluid might be written as
(39.81) ρ = ρ ( n , s ) (39.81) ρ = ρ ( n , s ) {:(39.81)rho=rho(n","s):}\begin{equation*} \rho=\rho(n, s) \tag{39.81} \end{equation*}(39.81)ρ=ρ(n,s)
that is, we specify the total energy density as a function of baryon density and specific entropy. With this assignment, we can use the first law (eqn 39.73) to gives us an expression for pressure in terms of energy. Taking the derivative with respect to n n nnn at constant s s sss leads to
(39.82) p ( n , s ) = n ( ρ n ) s ρ (39.82) p ( n , s ) = n ρ n s ρ {:(39.82)p(n","s)=n((del rho)/(del n))_(s)-rho:}\begin{equation*} p(n, s)=n\left(\frac{\partial \rho}{\partial n}\right)_{s}-\rho \tag{39.82} \end{equation*}(39.82)p(n,s)=n(ρn)sρ
An alternative might we to specify a free energy function a a aaa such as
(39.83) a = a ( n , T ) , (39.83) a = a ( n , T ) , {:(39.83)a=a(n","T)",":}\begin{equation*} a=a(n, T), \tag{39.83} \end{equation*}(39.83)a=a(n,T),
where the natural variables are n n nnn and T T TTT. Again, we can derive an expression for pressure. The pressure (and remember that it is gradients in this quantity that cause flow lines to depart from being geodesics) is generally a first derivative of the free energy. In this case
(39.84) p ( n , T ) = n 2 ( a n ) T (39.84) p ( n , T ) = n 2 a n T {:(39.84)p(n","T)=n^(2)((del a)/(del n))_(T):}\begin{equation*} p(n, T)=n^{2}\left(\frac{\partial a}{\partial n}\right)_{T} \tag{39.84} \end{equation*}(39.84)p(n,T)=n2(an)T
This is the form used in Chapter 40.
This completes our review of fluids. One final comment to make is that for the comoving observer, the natural derivative is the Lie derivative described in Chapter 33. This is examined in the exercises.

Chapter summary

  • Fluid mechanics relies on the velocity field of the fluid or, equivalently its integral curves.
  • Euler's equation expresses the equations of motion of the fluid; Bernoulli's equation describes the conservation of energy. Both of these are contained in the expression T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0.
  • In relativistic fluid mechanics, Euler's equation expresses the deviation from geodesic behaviour caused by gradients in the pressure perpendicular to the local velocity.

Exercises

39.1) Use Bernoulli's theorem to work out how fast fluid flows from a pipe at the bottom of a reservoir. Give your answer in terms of the heights of the top of the fluid y 2 y 2 y_(2)y_{2}y2 and the outlet pipe y 1 y 1 y_(1)y_{1}y1.
(39.2) In special relativity, we write the components of the velocity vector as u μ = ( u 0 , u ) = ( γ , γ v ) u μ = u 0 , u = ( γ , γ v ) u^(mu)=(u^(0),( vec(u)))=(gamma,gamma vec(v))u^{\mu}=\left(u^{0}, \vec{u}\right)=(\gamma, \gamma \vec{v})uμ=(u0,u)=(γ,γv), where γ = ( 1 | v | 2 ) 1 2 γ = 1 | v | 2 1 2 gamma=(1-|( vec(v))|^(2))^(-(1)/(2))\gamma=\left(1-|\vec{v}|^{2}\right)^{-\frac{1}{2}}γ=(1|v|2)12. The flat space, relativistic version of Bernoulli's equation for steady flow
is given by
(39.85) d B d τ = γ v B = 0 (39.85) d B d τ = γ v B = 0 {:(39.85)(dB)/((d)tau)=gamma vec(v)* vec(grad)B=0:}\begin{equation*} \frac{\mathrm{d} B}{\mathrm{~d} \tau}=\gamma \vec{v} \cdot \vec{\nabla} B=0 \tag{39.85} \end{equation*}(39.85)dB dτ=γvB=0
where B = γ ( ρ + p ) / ρ 0 B = γ ( ρ + p ) / ρ 0 B=gamma(rho+p)//rho_(0)B=\gamma(\rho+p) / \rho_{0}B=γ(ρ+p)/ρ0. We shall prove this.
(a) Start by showing that T 0 μ , μ = 0 T 0 μ , μ = 0 T^(0mu)_(,mu)=0T^{0 \mu}{ }_{, \mu}=0T0μ,μ=0 leads to
(39.86) d d τ [ γ ( ρ + p ) ] + ( p + ρ ) γ u = 0 (39.86) d d τ [ γ ( ρ + p ) ] + ( p + ρ ) γ u = 0 {:(39.86)(d)/((d)tau)[gamma(rho+p)]+(p+rho)gamma vec(grad)* vec(u)=0:}\begin{equation*} \frac{\mathrm{d}}{\mathrm{~d} \tau}[\gamma(\rho+p)]+(p+\rho) \gamma \vec{\nabla} \cdot \vec{u}=0 \tag{39.86} \end{equation*}(39.86)d dτ[γ(ρ+p)]+(p+ρ)γu=0
(b) Next, use the conservation of rest mass equation, u = 1 ρ 0 d ρ 0 d τ u = 1 ρ 0 d ρ 0 d τ grad*u=-(1)/(rho_(0))*((d)rho_(0))/((d)tau)\boldsymbol{\nabla} \cdot \boldsymbol{u}=-\frac{1}{\rho_{0}} \cdot \frac{\mathrm{~d} \rho_{0}}{\mathrm{~d} \tau}u=1ρ0 dρ0 dτ, to prove eqn 39.85 .
(39.3) The speed of sound in a static, perfect fluid. Consider a density wave in flat spacetime, defined by
(39.87) n = n + δ n , ρ = ρ + δ ρ , p = p + δ p , s = s + δ s , and v = δ v , (39.87) n = n + δ n , ρ = ρ + δ ρ , p = p + δ p , s = s + δ s ,  and  v = δ v , {:(39.87){:[n=n_(**)+delta n",",,rho=rho_(**)+delta rho","],[p=,p_(**)+delta p",",s=s_(**)+delta s","],[," and ",, vec(v)=delta vec(v)","]:}:}\begin{array}{rlrl} n=n_{*}+\delta n, & & \rho=\rho_{*}+\delta \rho, \\ p= & p_{*}+\delta p, & s=s_{*}+\delta s, \tag{39.87}\\ & \text { and } & & \vec{v}=\delta \vec{v}, \end{array}(39.87)n=n+δn,ρ=ρ+δρ,p=p+δp,s=s+δs, and v=δv,
where the subscript * denotes the constant, equilibrium values.
(a) Show that the baryon conservation law yields
(39.88) t δ n + n δ v = 0 (39.88) t δ n + n δ v = 0 {:(39.88)(del)/(del t)delta n+n_(**) vec(grad)*delta vec(v)=0:}\begin{equation*} \frac{\partial}{\partial t} \delta n+n_{*} \vec{\nabla} \cdot \delta \vec{v}=0 \tag{39.88} \end{equation*}(39.88)tδn+nδv=0
(b) Show further that
(39.89) ( ρ + p ) t δ v + δ p = 0 (39.89) ρ + p t δ v + δ p = 0 {:(39.89)(rho_(**)+p_(**))(del)/(del t)delta vec(v)+ vec(grad)delta p=0:}\begin{equation*} \left(\rho_{*}+p_{*}\right) \frac{\partial}{\partial t} \delta \vec{v}+\vec{\nabla} \delta p=0 \tag{39.89} \end{equation*}(39.89)(ρ+p)tδv+δp=0
(c) Explain why we must also have
(39.90) n δ ρ = ( ρ + p ) δ n (39.90) n δ ρ = ρ + p δ n {:(39.90)n_(**)delta rho=(rho_(**)+p_(**))delta n:}\begin{equation*} n_{*} \delta \rho=\left(\rho_{*}+p_{*}\right) \delta n \tag{39.90} \end{equation*}(39.90)nδρ=(ρ+p)δn
We define the speed of sound as
(39.91) c s 2 = ( p ρ ) s (39.91) c s 2 = p ρ s {:(39.91)c_(s)^(2)=((del p)/(del rho))_(s):}\begin{equation*} c_{\mathrm{s}}^{2}=\left(\frac{\partial p}{\partial \rho}\right)_{s} \tag{39.91} \end{equation*}(39.91)cs2=(pρ)s
(d) Use the expressions above to show that the equation of motion for δ n δ n delta n\delta nδn is the wave equation
(39.92) 1 c s 2 2 t 2 δ n = 2 δ n (39.92) 1 c s 2 2 t 2 δ n = 2 δ n {:(39.92)(1)/(c_(s)^(2))(del^(2))/(delt^(2))delta n= vec(grad)^(2)delta n:}\begin{equation*} \frac{1}{c_{\mathrm{s}}^{2}} \frac{\partial^{2}}{\partial t^{2}} \delta n=\vec{\nabla}^{2} \delta n \tag{39.92} \end{equation*}(39.92)1cs22t2δn=2δn
Hint: The velocity has components u μ = ( 1 , 0 ) u μ = ( 1 , 0 ) u^(mu)=(1,0)u^{\mu}=(1,0)uμ=(1,0) and ( δ u ) μ = ( γ , γ δ v ) ( δ u ) μ = ( γ , γ δ v ) (delta u)^(mu)=(gamma,gamma delta vec(v))(\delta \boldsymbol{u})^{\mu}=(\gamma, \gamma \delta \vec{v})(δu)μ=(γ,γδv). However, since δ v δ v delta vec(v)\delta \vec{v}δv is small, we have γ 1 γ 1 gamma~~1\gamma \approx 1γ1 and so ( δ u ) μ = ( 1 , δ v ) ( δ u ) μ = ( 1 , δ v ) (delta u)^(mu)=(1,delta vec(v))(\delta \boldsymbol{u})^{\mu}=(1, \delta \vec{v})(δu)μ=(1,δv).
(39.4) Define the mass current of point particles as
(39.93) J m α = d τ m g d z α d τ δ ( 4 ) [ x z ( τ ) ] (39.93) J m α = d τ m g d z α d τ δ ( 4 ) [ x z ( τ ) ] {:(39.93)J_(m)^(alpha)=intdtau(m)/(sqrt(-g))((d)z^(alpha))/(dtau)delta^((4))[x-z(tau)]:}\begin{equation*} J_{\mathrm{m}}^{\alpha}=\int \mathrm{d} \tau \frac{m}{\sqrt{-g}} \frac{\mathrm{~d} z^{\alpha}}{\mathrm{d} \tau} \delta^{(4)}[x-z(\tau)] \tag{39.93} \end{equation*}(39.93)Jmα=dτmg dzαdτδ(4)[xz(τ)]
Use the useful identity μ A μ = 1 g x μ ( g A μ ) μ A μ = 1 g x μ g A μ grad_(mu)A^(mu)=(1)/(sqrt(-g))(del)/(delx^(mu))(sqrt(-g)A^(mu))\nabla_{\mu} A^{\mu}=\frac{1}{\sqrt{-g}} \frac{\partial}{\partial x^{\mu}}\left(\sqrt{-g} A^{\mu}\right)μAμ=1gxμ(gAμ) (proved in Exercise 34.3) to show that the mass current is divergenceless.
Hint: Use the distributional identity
(39.94) u α α δ ( 4 ) [ x z ( τ ) ] = d d τ δ ( 4 ) [ x z ( τ ) ] (39.94) u α α δ ( 4 ) [ x z ( τ ) ] = d d τ δ ( 4 ) [ x z ( τ ) ] {:(39.94)u^(alpha)del_(alpha)delta^((4))[x-z(tau)]=-(d)/((d)tau)delta^((4))[x-z(tau)]:}\begin{equation*} u^{\alpha} \partial_{\alpha} \delta^{(4)}[x-z(\tau)]=-\frac{\mathrm{d}}{\mathrm{~d} \tau} \delta^{(4)}[x-z(\tau)] \tag{39.94} \end{equation*}(39.94)uααδ(4)[xz(τ)]=d dτδ(4)[xz(τ)]
(39.5) Using the connection coefficients for the Schwarzschild geometry given in Chapter 21, show that T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0 implies that, for a perfect fluid,
(39.95) p r + ( p + ρ ) Φ r = 0 (39.95) p r + ( p + ρ ) Φ r = 0 {:(39.95)(del p)/(del r)+(p+rho)(del Phi)/(del r)=0:}\begin{equation*} \frac{\partial p}{\partial r}+(p+\rho) \frac{\partial \Phi}{\partial r}=0 \tag{39.95} \end{equation*}(39.95)pr+(p+ρ)Φr=0
Hint: Remember to represent the components of T T T\boldsymbol{T}T in spherical polar coordinates. These can be transformed from the orthonormal-frame version using the appropriate vielbein.
(39.6) In a non-relativistic fluid, conservation of mass is expressed as
(39.96) ρ 0 t + ( ρ 0 v ) = 0 (39.96) ρ 0 t + ρ 0 v = 0 {:(39.96)(delrho_(0))/(del t)+ vec(grad)*(rho_(0)( vec(v)))=0:}\begin{equation*} \frac{\partial \rho_{0}}{\partial t}+\vec{\nabla} \cdot\left(\rho_{0} \vec{v}\right)=0 \tag{39.96} \end{equation*}(39.96)ρ0t+(ρ0v)=0
The comoving observer is carried along by the fluid's 3 -velocity field v v vec(v)\vec{v}v and measures the change in the mass field ρ 0 V ρ 0 V rho_(0)V\rho_{0} Vρ0V. We shall upgrade this into the 3 -volume V V VVV into the language of geometry. We first define a volume element 3 -form ω ~ = d x d y d z ω ~ = d x d y d z tilde(omega)=dx^^dy^^dz\tilde{\boldsymbol{\omega}}=\boldsymbol{d} x \wedge \boldsymbol{d} y \wedge \boldsymbol{d} zω~=dxdydz.
(a) Let's find the Lie derivative £ v ( ρ 0 ω ~ ) £ v ρ 0 ω ~ £_( vec(v))(rho_(0)( tilde(omega)))£_{\vec{v}}\left(\rho_{0} \tilde{\boldsymbol{\omega}}\right)£v(ρ0ω~). Show, using components, that
(39.97) £ v ( ρ 0 ω ~ ) = [ ( ρ 0 v ) ] ω ~ . (39.97) £ v ρ 0 ω ~ = ρ 0 v ω ~ . {:(39.97)£_( vec(v))(rho_(0)( tilde(omega)))=[( vec(grad))*(rho_(0)( vec(v)))] tilde(omega).:}\begin{equation*} £_{\vec{v}}\left(\rho_{0} \tilde{\boldsymbol{\omega}}\right)=\left[\vec{\nabla} \cdot\left(\rho_{0} \vec{v}\right)\right] \tilde{\boldsymbol{\omega}} . \tag{39.97} \end{equation*}(39.97)£v(ρ0ω~)=[(ρ0v)]ω~.
(b) For the Lie derivatives of p p ppp-forms, there is a useful rule that says
(39.98) £ v ω ~ = d ω ~ , v + d ω ~ , v (39.98) £ v ω ~ = d ω ~ , v + d ω ~ , v {:(39.98)£_(v) tilde(omega)=d(: tilde(omega)","v:)+(:d tilde(omega)","v:):}\begin{equation*} £_{v} \tilde{\omega}=\boldsymbol{d}\langle\tilde{\omega}, \boldsymbol{v}\rangle+\langle\boldsymbol{d} \tilde{\omega}, v\rangle \tag{39.98} \end{equation*}(39.98)£vω~=dω~,v+dω~,v
where v v v\boldsymbol{v}v is a vector field. [This is proved in Schutz (1980), whose discussion we follow here.] Use this rule to confirm the result in part (a).
(c) Comparing the result proved in the first parts of the question to the conservation of mass equation equation, show that
(39.99) ( t + £ v ) ( ρ 0 ω ~ ) = 0 (39.99) t + £ v ρ 0 ω ~ = 0 {:(39.99)((del)/(del t)+£_( vec(v)))(rho_(0)( tilde(omega)))=0:}\begin{equation*} \left(\frac{\partial}{\partial t}+£_{\vec{v}}\right)\left(\rho_{0} \tilde{\boldsymbol{\omega}}\right)=0 \tag{39.99} \end{equation*}(39.99)(t+£v)(ρ0ω~)=0
See Schutz (1980) for further details.
(39.7) We can write the non-relativistic Euler equation in flat space as
t v i + v j x j v i + 1 ρ 0 x i p + x i Φ = 0 t v i + v j x j v i + 1 ρ 0 x i p + x i Φ = 0 (del)/(del t)v_(i)+v^(j)(del)/(delx^(j))v_(i)+(1)/(rho_(0))(del)/(delx^(i))p+(del)/(delx^(i))Phi=0\frac{\partial}{\partial t} v_{i}+v^{j} \frac{\partial}{\partial x^{j}} v_{i}+\frac{1}{\rho_{0}} \frac{\partial}{\partial x^{i}} p+\frac{\partial}{\partial x^{i}} \Phi=0tvi+vjxjvi+1ρ0xip+xiΦ=0.
Show that, with v ~ = v i d x i v ~ = v i d x i tilde(v)=v_(i)dx^(i)\tilde{\boldsymbol{v}}=v_{i} \mathrm{~d} x^{i}v~=vi dxi, this can be rewritten in terms of Lie and exterior derivatives as
( t + £ v ) v ~ + 1 ρ 0 d p + d ( Φ v 2 / 2 ) = 0 t + £ v v ~ + 1 ρ 0 d p + d Φ v 2 / 2 = 0 ((del)/(del t)+£_( vec(v))) tilde(v)+(1)/(rho_(0))dp+d(Phi-v^(2)//2)=0\left(\frac{\partial}{\partial t}+£_{\vec{v}}\right) \tilde{\boldsymbol{v}}+\frac{1}{\rho_{0}} \boldsymbol{d} p+\boldsymbol{d}\left(\Phi-v^{2} / 2\right)=0(t+£v)v~+1ρ0dp+d(Φv2/2)=0
(39.8) If v v vec(v)\vec{v}v is a uniform velocity field for a fluid in Euclidean space, find a 2 -form f ~ f ~ tilde(f)\tilde{\boldsymbol{f}}f~ that outputs the flux of the fluid through a parallelogram formed by two displacements a a vec(a)\vec{a}a and b b vec(b)\vec{b}b.
(39.9) We can use fluid mechanics to compute the behaviour of the cosmological fluid in the Einstein-de Sitter universe (Universe 3 in Chapter 18), whose dynamics we can assume is Newtonian.
(a) Show that for a Hubble-law expansion with v = ( a ˙ / a ) r v = ( a ˙ / a ) r vec(v)=(a^(˙)//a) vec(r)\vec{v}=(\dot{a} / a) \vec{r}v=(a˙/a)r we expect a non-relativistic Euler equation of
Φ = v t + ( v ) v = ( a ¨ a ) r Φ = v t + ( v ) v = ( a ¨ a ) r - vec(grad)Phi=(del( vec(v)))/(del t)+( vec(v)* vec(grad)) vec(v)=(((a^(¨)))/(a)) vec(r)-\vec{\nabla} \Phi=\frac{\partial \vec{v}}{\partial t}+(\vec{v} \cdot \vec{\nabla}) \vec{v}=\binom{\ddot{a}}{a} \vec{r}Φ=vt+(v)v=(a¨a)r
(39.102)

where Φ Φ Phi\PhiΦ is the potential.

(b) Show that this is consistent with (i) the Friedmann equations, and (ii) a Poisson equation with potential Φ = 2 π ρ r 2 / 3 Φ = 2 π ρ r 2 / 3 Phi=2pi rhor^(2)//3\Phi=2 \pi \rho r^{2} / 3Φ=2πρr2/3.
Now consider a perturbation
(39.103) ρ ρ + δ ρ , v v + δ v Φ Φ + δ Φ (39.103) ρ ρ + δ ρ , v v + δ v Φ Φ + δ Φ {:[(39.103)rho rarr rho+delta rho","quad vec(v)rarr vec(v)+delta vec(v)],[Phi rarr Phi+delta Phi]:}\begin{align*} \rho & \rightarrow \rho+\delta \rho, \quad \vec{v} \rightarrow \vec{v}+\delta \vec{v} \tag{39.103}\\ \Phi & \rightarrow \Phi+\delta \Phi \end{align*}(39.103)ρρ+δρ,vv+δvΦΦ+δΦ
(c) Show that the linearized mass-continuity equation becomes
(39.104) 0 = [ t + ( v ) ] δ ρ ρ + δ v (39.104) 0 = t + ( v ) δ ρ ρ + δ v {:(39.104)0=[(del)/(del t)+(( vec(v))*( vec(grad)))](delta rho)/(rho)+ vec(grad)*delta vec(v):}\begin{equation*} 0=\left[\frac{\partial}{\partial t}+(\vec{v} \cdot \vec{\nabla})\right] \frac{\delta \rho}{\rho}+\vec{\nabla} \cdot \delta \vec{v} \tag{39.104} \end{equation*}(39.104)0=[t+(v)]δρρ+δv
(d) Show that the linearized equation of motion becomes
(39.105) 0 = [ t + ( v ) ] δ v + a ˙ a δ v = δ Φ (39.105) 0 = t + ( v ) δ v + a ˙ a δ v = δ Φ {:(39.105)0=[(del)/(del t)+(( vec(v))*( vec(grad)))]delta vec(v)+((a^(˙)))/(a)delta vec(v)=- vec(grad)delta Phi:}\begin{equation*} 0=\left[\frac{\partial}{\partial t}+(\vec{v} \cdot \vec{\nabla})\right] \delta \vec{v}+\frac{\dot{a}}{a} \delta \vec{v}=-\vec{\nabla} \delta \Phi \tag{39.105} \end{equation*}(39.105)0=[t+(v)]δv+a˙aδv=δΦ
(e) We can explicitly make a change to comoving coordinates with the transformation
(39.106) t = t , x i = a ( t ) x i (39.106) t = t , x i = a t x i {:(39.106)t=t^(')","quadx^(i)=a(t^('))x^(i^(')):}\begin{equation*} t=t^{\prime}, \quad x^{i}=a\left(t^{\prime}\right) x^{i^{\prime}} \tag{39.106} \end{equation*}(39.106)t=t,xi=a(t)xi
Show that this is consistent with the expected rule
(39.107) t + v = t (39.107) t + v = t {:(39.107)(del)/(del t)+ vec(v)* vec(grad)=(del)/(delt^(')):}\begin{equation*} \frac{\partial}{\partial t}+\vec{v} \cdot \vec{\nabla}=\frac{\partial}{\partial t^{\prime}} \tag{39.107} \end{equation*}(39.107)t+v=t
(f) In terms of these new coordinates, verify that eqns 39.104 and 39.105 can be combined to give
(39.108) [ 2 t 2 + 2 a ˙ a t ] δ ρ ρ = 1 a 2 2 δ Φ (39.108) 2 t 2 + 2 a ˙ a t δ ρ ρ = 1 a 2 2 δ Φ {:(39.108)[(del^(2))/(delt^('2))+(2(a^(˙)))/(a)(del)/(delt^('))](delta rho)/(rho)=(1)/(a^(2)) vec(grad)^('2)delta Phi:}\begin{equation*} \left[\frac{\partial^{2}}{\partial t^{\prime 2}}+\frac{2 \dot{a}}{a} \frac{\partial}{\partial t^{\prime}}\right] \frac{\delta \rho}{\rho}=\frac{1}{a^{2}} \vec{\nabla}^{\prime 2} \delta \Phi \tag{39.108} \end{equation*}(39.108)[2t2+2a˙at]δρρ=1a22δΦ
(g) By invoking the properties of the Einstein-de Sitter universe, show that this in turn reduces to
(39.109) [ 2 t 2 + 4 3 t t 2 3 t 2 ] δ ρ ρ = 0 (39.109) 2 t 2 + 4 3 t t 2 3 t 2 δ ρ ρ = 0 {:(39.109)[(del^(2))/(delt^('2))+(4)/(3t^('))(del)/(delt^('))-(2)/(3t^('2))](delta rho)/(rho)=0:}\begin{equation*} \left[\frac{\partial^{2}}{\partial t^{\prime 2}}+\frac{4}{3 t^{\prime}} \frac{\partial}{\partial t^{\prime}}-\frac{2}{3 t^{\prime 2}}\right] \frac{\delta \rho}{\rho}=0 \tag{39.109} \end{equation*}(39.109)[2t2+43tt23t2]δρρ=0
(h) Demonstrate that the equation has two solutions: one that decays as 1 / t 1 / t 1//t^(')1 / t^{\prime}1/t and another that grows as t 2 / 3 t 2 / 3 t^('2//3)t^{\prime 2 / 3}t2/3.
This is important as it shows that any perturbation to this universe either decays or grows very slowly.

40

40.1 Matter fields 429
40.2 Action and equations of mo-
tion 430
40.3 Fields in curved spacetime
433
40.4 Motivating the Einstein
equation 434
40.5 Energy-momentum tensor
437
40.6 Noether's theorem 438
40.7 The perfect fluid 440
Chapter summary 443
Exercises 443
40.1 Matter fields 429 40.2 Action and equations of mo- tion 430 40.3 Fields in curved spacetime 433 40.4 Motivating the Einstein equation 434 40.5 Energy-momentum tensor 437 40.6 Noether's theorem 438 40.7 The perfect fluid 440 Chapter summary 443 Exercises 443 | 40.1 | Matter fields | 429 | | :--- | :--- | ---: | | 40.2 | Action and equations of mo- | | | tion | 430 | | | 40.3 | Fields in curved | spacetime | | | | 433 | | 40.4 | Motivating the | Einstein | | | equation | 434 | | 40.5 | Energy-momentum | tensor | | | | 437 | | 40.6 | Noether's theorem | 438 | | 40.7 | The perfect fluid | 440 | | Chapter summary | 443 | | | Exercises | 443 | |
\curvearrowright The material in this chapter is useful for understanding inflation (Chapter 41) and electromagnetism (Chapter 42). It also forms the basis for string theory (Chapter 49).
1 1 ^(1){ }^{1}1 David Hilbert (1862-1943) was one of the most influential mathematicians of the nineteenth and twentieth centuries. An example of his influence was the collection of mathematical problems, presented in 1900, that largely set the agenda for research in the twentieth century. The Einstein-Hilbert action was proposed by Hilbert in 1915.
2 2 ^(2){ }^{2}2 Einstein developed general relativity between 1907 and 1915. He arrived at his field equations in late 1915. The history is presented in Pais' Subtle is the Lord (2005), and also in Cheng's Einstein's Physics (2013). As pointed out by Ohanian and Ruffini, Einstein gave us general relativity early: we really only deserved it around twenty years later when the relativistic gauge-field techniques discussed in this part of the book had matured.

Lagrangian field theory

The reader will find no figures in this work. The methods which I set forth do not require either constructions or geometrical or mechanical reasonings: but only algebraic operations, subject to a regular and uniform rule of procedure. Those who love Analysis will see with pleasure that Mechanics has become a branch of it, and will be grateful to me for having thus extended its domain.
Joseph Lagrange (1736-1813) Mécanique Analytique
A classical field can be thought of as a machine whose input is a position x x xxx in spacetime and whose output is the amplitude of the field at that point. General relativity is a classical field theory and so, like all field theories, we should therefore be able to deal with it using the machinery of Lagrangians. This approach has many advantages, not least that it represents a systematic means of deriving information about the fields and their interactions. Previously, for example, we have had to make educated guesses about the shape and content of the energymomentum tensor T T T\boldsymbol{T}T. This is rather unsatisfactory given the importance of T T T\boldsymbol{T}T in supplying the right-hand side of the Einstein field equation. In contrast to this ad-hoc approach, a Lagrangian field theory has an inbuilt, turn-the-handle recipe for deriving the energy-momentum tensor. We need only supply the matter fields and the Lagrangian-field-theory machine outputs a suitable field T ( x ) T ( x ) T(x)\boldsymbol{T}(x)T(x) containing all of the information about the energy content of the fields we have inputted. In this chapter, we rederive the Einstein equation systematically by following the Lagrangian method. When general relativity was being formulated, David Hilbert 1 1 ^(1){ }^{1}1 searched for a Lagrangian formulation that would describe gravitating systems. He found it (and it is now named in his honour) but only after Einstein had found his field equations via a less systematic method, narrowly beating Hilbert to the solution. 2 2 ^(2){ }^{2}2
Before starting, it is worth a reminder of the fields with which we are concerned. General relativity couples the geometry of spacetime to the matter fields of the Universe. One class of fields is therefore the matter fields that describes the distribution of both massive and massless matter throughout spacetime. (This class of matter fields also include the 1 -form field A ~ ( x ) A ~ ( x ) tilde(A)(x)\tilde{\boldsymbol{A}}(x)A~(x) that describes electromagnetism.) These matter fields go together to form T ( x ) T ( x ) T(x)\boldsymbol{T}(x)T(x). The other class has a single member: it is the field that governs the curvature of spacetime. This is the ( 0 , 2 ) ( 0 , 2 ) (0,2)(0,2)(0,2)
metric field g ( x ) g ( x ) g(x)\boldsymbol{g}(x)g(x). A Lagrangian field theory version of general relativity must couple these fields.
Let's now build a general Lagrangian formulation of classical field theory. We do this in such a way as to maximize the resemblance of field theory to the Lagrangian formulation of classical particle mechanics, that we have already met. Our procedure will involve writing down a Lagrangian that describes the metric field, the matter fields and their interactions. The Lagrangian can then be fed through a set of EulerLagrange equations to output a generalized form of the Einstein field equations.

40.1 Matter fields

Every particle in the Universe is an excitation of a matter field. This statement is the basis of quantum field theory (QFT), which describes the properties of matter fields on a quantum level. It does this by taking a classical description of the matter fields and quantizing it. That is, it turns the fields into quantum-mechanical operators that have commutation relations with each other. The quantum-field operators act on quantum states, creating or annihilating excitations. 3 3 ^(3){ }^{3}3
General relativity deals with classical fields, without the step of quantization described above. However, we still start by following the same route as is followed in QFT, where we write down a description of the matter in the Universe in terms of classical matter fields. We might wonder what the fields represent. We've said that a field can be thought of as a machine that inputs a position in spacetime and outputs an amplitude; in a classical field theory this amplitude typically describes the mass-energy density of matter.
Example 40.1
We shall meet several kinds of classical field in this chapter.
  • A scalar field inputs a position in spacetime x x xxx and outputs a scalar amplitude that describes the distribution of matter. Specifically, the real scalar field ϕ ( x ) ϕ ( x ) phi(x)\phi(x)ϕ(x) gives the probability amplitude for finding matter at point x x vec(x)\vec{x}x at a time t t ttt, such that the probability of finding a particle in a region d Σ d Σ dSigma\mathrm{d} \SigmadΣ centred on x x vec(x)\vec{x}x is ϕ ( x ) 2 d Σ ϕ ( x ) 2 d Σ phi(x)^(2)dSigma\phi(x)^{2} \mathrm{~d} \Sigmaϕ(x)2 dΣ, where d Σ d Σ dSigma\mathrm{d} \SigmadΣ probability of finding a particle in a region d Σ d Σ dSigma\mathrm{d} \SigmadΣ centred on x x vec(x)\vec{x}x is ϕ ( x ) 2 d Σ ϕ ( x ) 2 d Σ phi(x)^(2)dSigma\phi(x)^{2} \mathrm{~d} \Sigmaϕ(x)2 dΣ, where d Σ d Σ dSigma\mathrm{d} \SigmadΣ
    is an element of 3 -volume. This field is quantized in QFT to describe excitations in is an element of 3 -volume. This field is quantiz
    the field, which are massive, spinless particles.
    the field, which are massive, spinless particles.
  • A vector field or 1 -form field inputs a position in spacetime and outputs a vector-- A vector field or 1 1 1\mathbf{1}1-form field inputs a position in spacetime and outputs a vector-
    valued quantity or 1 -form valued quantity, respectively. An important example is the electromagnetic 1-form gauge field A ~ ( x ) A ~ ( x ) tilde(A)(x)\tilde{\boldsymbol{A}}(x)A~(x) that can be used to determine the electric and magnetic fields at a point in spacetime. It is quantized in QFT into photons, the massless particle excitations of light.
  • In general relativity, we most often deal with the mass density field ρ ( x ) ρ ( x ) rho(x)\rho(x)ρ(x) for a perfect fluid. This tells us the density of fluid at a point x x xxx in spacetime, such that the mass of fluid in a volume d Σ d Σ dSigma\mathrm{d} \SigmadΣ centred at x x vec(x)\vec{x}x at a particular time is ρ ( x ) d Σ ρ ( x ) d Σ rho(x)dSigma\rho(x) \mathrm{d} \Sigmaρ(x)dΣ. This is not a field that features in most standard treatments of QFT.
Knowing a field locally (that is, at our position in spacetime x x xxx ) gives us access to the distribution of matter described by the field close to us (that
3 3 ^(3){ }^{3}3 See Lancaster and Blundell, Quantum Field Theory for the Gifted Amateur (2014), for a full account.

\curvearrowright Chapter 49 contains a discussion of approaches to quan-

tum gravity.

4 4 ^(4){ }^{4}4 Some often-used notation is that, in flat space, partial derivatives with respect to a coordinate x μ x μ x^(mu)x^{\mu}xμ are often written as
(40.4) μ ϕ ϕ x μ (40.4) μ ϕ ϕ x μ {:(40.4)del_(mu)phi-=(del phi)/(delx^(mu)):}\begin{equation*} \partial_{\mu} \phi \equiv \frac{\partial \phi}{\partial x^{\mu}} \tag{40.4} \end{equation*}(40.4)μϕϕxμ
In comma notation, we would write μ ϕ ϕ , μ μ ϕ ϕ , μ del_(mu)phi-=phi_(,mu)\partial_{\mu} \phi \equiv \phi_{, \mu}μϕϕ,μ. The advantage of this latter notation is that in curved space, commas can be upgraded to semicolons to signify a covariant partial derivative.
5 5 ^(5){ }^{5}5 Lagrangian density is usually shortened to 'Lagrangian' to save effort, which we'll do from here.
is, the fields are local). If we also have the field's equation of motion, we shall be able to make predictions about the behaviour of the field at other points in spacetime, and the consequences of its interactions with other fields. The first step towards finding this equation is to write down the action that describes the system, and it is to this we now turn.

40.2 Action and equations of motion

We have met the action of a system in particle mechanics. Mathematically the action is a functional S [ ] S [ ] S[]S[]S[] : a machine that inputs a function ϕ ( x ) ϕ ( x ) phi(x)\phi(x)ϕ(x) (or possibly several functions), and outputs a number S [ ϕ ] S [ ϕ ] S[phi]S[\phi]S[ϕ] with units of (energy) × × xx\times× (time). Classical particle mechanics involves writing the action S S SSS of a system in terms of the coordinates q i q i q_(i)q_{i}qi of the particles involved and their time derivatives q ˙ i q ˙ i q^(˙)_(i)\dot{q}_{i}q˙i. The action is found using the Lagrangian L ( q i , q ˙ i ) L q i , q ˙ i L(q_(i),q^(˙)_(i))L\left(q_{i}, \dot{q}_{i}\right)L(qi,q˙i) which is a function that we integrate with respect to time to obtain the action itself. Specifically, for N N NNN particles we have
(40.1) S [ q 1 , q 2 , , q N ] = d t L ( q 1 , q 2 , , q N , q ˙ 1 , q ˙ 2 , , q ˙ N ) (40.1) S q 1 , q 2 , , q N = d t L q 1 , q 2 , , q N , q ˙ 1 , q ˙ 2 , , q ˙ N {:(40.1)S[q_(1),q_(2),dots,q_(N)]=intdtL(q_(1),q_(2),dots,q_(N),q^(˙)_(1),q^(˙)_(2),dots,q^(˙)_(N)):}\begin{equation*} S\left[q_{1}, q_{2}, \ldots, q_{N}\right]=\int \mathrm{d} t L\left(q_{1}, q_{2}, \ldots, q_{N}, \dot{q}_{1}, \dot{q}_{2}, \ldots, \dot{q}_{N}\right) \tag{40.1} \end{equation*}(40.1)S[q1,q2,,qN]=dtL(q1,q2,,qN,q˙1,q˙2,,q˙N)
In particle mechanics, the action is simply the difference between the kinetic and potential energies of the system.
We now start upgrading this scheme to work on fields. In flat, threedimensional space, the Lagrangian is related to the Lagrangian density L L L\mathcal{L}L by
(40.2) L = d 3 x L (40.2) L = d 3 x L {:(40.2)L=intd^(3)xL:}\begin{equation*} L=\int \mathrm{d}^{3} x \mathcal{L} \tag{40.2} \end{equation*}(40.2)L=d3xL
so that
(40.3) S = d 4 x L (40.3) S = d 4 x L {:(40.3)S=intd^(4)xL:}\begin{equation*} S=\int \mathrm{d}^{4} x \mathcal{L} \tag{40.3} \end{equation*}(40.3)S=d4xL
The Lagrangian is so important that we often use a shorthand and call a Lagrangian or action that describes fields and their interaction a field theory.
Classical field theory involves writing the action S S SSS of a system in terms of the matter field ϕ ϕ phi\phiϕ involved and its derivatives with respect to space and time. 4 4 ^(4){ }^{4}4 This allows us to identify a Lagrangian density for the matter fields L m ( ϕ , μ ϕ ) L m ϕ , μ ϕ L_(m)(phi,del_(mu)phi)\mathcal{L}_{\mathrm{m}}\left(\phi, \partial_{\mu} \phi\right)Lm(ϕ,μϕ) : a function of the field and its derivatives that we integrate over all space and time to obtain the action. In flat space, we have, therefore, that
(40.5) S [ ϕ ] = d 4 x L m ( ϕ , μ ϕ ) (40.5) S [ ϕ ] = d 4 x L m ϕ , μ ϕ {:(40.5)S[phi]=intd^(4)xL_(m)(phi,del_(mu)phi):}\begin{equation*} S[\phi]=\int \mathrm{d}^{4} x \mathcal{L}_{\mathrm{m}}\left(\phi, \partial_{\mu} \phi\right) \tag{40.5} \end{equation*}(40.5)S[ϕ]=d4xLm(ϕ,μϕ)
The Lagrangian density of matter fields is analogous to the Lagrangian in particle mechanics, it reflects the difference in kinetic and potential energy densities of a field. In order to describe the matter fields in the Universe, we need to be able to write their Lagrangian densities in terms of the fields and their derivatives. 5 5 ^(5){ }^{5}5

Example 40.2

Consider waves on a string of mass m m mmm and length \ell. We define the mass density ρ = m / ρ = m / rho=m//ℓ\rho=m / \ellρ=m/, tension T T T\mathcal{T}T and displacement ψ ( x , t ) ψ ( x , t ) psi(x,t)\psi(x, t)ψ(x,t) from the equilibrium (see Fig. 40.1). The kinetic energy T T TTT can then be written as T = 1 2 0 d x ρ ( ψ / t ) 2 T = 1 2 0 d x ρ ( ψ / t ) 2 T=(1)/(2)int_(0)^(ℓ)dx rho(del psi//del t)^(2)T=\frac{1}{2} \int_{0}^{\ell} \mathrm{d} x \rho(\partial \psi / \partial t)^{2}T=120dxρ(ψ/t)2 and the potential energy V = 1 2 0 d x T ( ψ / x ) 2 V = 1 2 0 d x T ( ψ / x ) 2 V=(1)/(2)int_(0)^(ℓ)dxT(del psi//del x)^(2)V=\frac{1}{2} \int_{0}^{\ell} \mathrm{d} x \mathcal{T}(\partial \psi / \partial x)^{2}V=120dxT(ψ/x)2. The action is then
(40.6) S [ ψ ( x , t ) ] = d t ( T V ) = d t d x L m ( ψ , ψ t , ψ x ) (40.7) where L m ( ψ , ψ t , ψ x ) = ρ 2 ( ψ t ) 2 T 2 ( ψ x ) 2 is the Lagrangian density. (40.6) S [ ψ ( x , t ) ] = d t ( T V ) = d t d x L m ψ , ψ t , ψ x (40.7)  where  L m ψ , ψ t , ψ x = ρ 2 ψ t 2 T 2 ψ x 2  is the Lagrangian density.  {:[(40.6)qquad S[psi(x","t)]=intdt(T-V)=intdtdxL_(m)(psi,(del psi)/(del t),(del psi)/(del x))],[(40.7)" where "],[qquadL_(m)(psi,(del psi)/(del t),(del psi)/(del x))=(rho)/(2)((del psi)/(del t))^(2)-(T)/(2)((del psi)/(del x))^(2)],[" is the Lagrangian density. "]:}\begin{align*} & \qquad S[\psi(x, t)]=\int \mathrm{d} t(T-V)=\int \mathrm{d} t \mathrm{~d} x \mathcal{L}_{\mathrm{m}}\left(\psi, \frac{\partial \psi}{\partial t}, \frac{\partial \psi}{\partial x}\right) \tag{40.6}\\ & \text { where } \tag{40.7}\\ & \qquad \mathcal{L}_{\mathrm{m}}\left(\psi, \frac{\partial \psi}{\partial t}, \frac{\partial \psi}{\partial x}\right)=\frac{\rho}{2}\left(\frac{\partial \psi}{\partial t}\right)^{2}-\frac{\mathcal{T}}{2}\left(\frac{\partial \psi}{\partial x}\right)^{2} \\ & \text { is the Lagrangian density. } \end{align*}(40.6)S[ψ(x,t)]=dt(TV)=dt dxLm(ψ,ψt,ψx)(40.7) where Lm(ψ,ψt,ψx)=ρ2(ψt)2T2(ψx)2 is the Lagrangian density. 
In most cases we assume a form for the Lagrangian such as those discussed in the next example.
Example 40.3
A free scalar field has the Lagrangian
(40.8) Lagrangian L m = 1 2 ( μ ϕ ) 2 m 2 ϕ 2 2 (40.8)  Lagrangian  L m = 1 2 μ ϕ 2 m 2 ϕ 2 2 {:[(40.8)" Lagrangian "],[L_(m)=-(1)/(2)(del_(mu)phi)^(2)-(m^(2)phi^(2))/(2)]:}\begin{align*} & \text { Lagrangian } \tag{40.8}\\ & \mathcal{L}_{\mathrm{m}}=-\frac{1}{2}\left(\partial_{\mu} \phi\right)^{2}-\frac{m^{2} \phi^{2}}{2} \end{align*}(40.8) Lagrangian Lm=12(μϕ)2m2ϕ22
where m m mmm is a constant and
( μ ϕ ) 2 = ( μ ϕ ) ( μ ϕ ) (40.9) = g μ ν ( μ ϕ ) ( ν ϕ ) μ ϕ 2 = μ ϕ μ ϕ (40.9) = g μ ν μ ϕ ν ϕ {:[(del_(mu)phi)^(2)=(del_(mu)phi)*(del_(mu)phi)],[(40.9)=g^(mu nu)(del_(mu)phi)(del_(nu)phi)]:}\begin{align*} \left(\partial_{\mu} \phi\right)^{2} & =\left(\partial_{\mu} \phi\right) \cdot\left(\partial_{\mu} \phi\right) \\ & =g^{\mu \nu}\left(\partial_{\mu} \phi\right)\left(\partial_{\nu} \phi\right) \tag{40.9} \end{align*}(μϕ)2=(μϕ)(μϕ)(40.9)=gμν(μϕ)(νϕ)
In Minkowski space, this yields
(40.10) L m = 1 2 ( ϕ t ) 2 1 2 ( ϕ ) 2 m 2 2 ϕ 2 (40.10) L m = 1 2 ϕ t 2 1 2 ( ϕ ) 2 m 2 2 ϕ 2 {:(40.10)L_(m)=(1)/(2)((del phi)/(del t))^(2)-(1)/(2)( vec(grad)phi)^(2)-(m^(2))/(2)*phi^(2):}\begin{equation*} \mathcal{L}_{\mathrm{m}}=\frac{1}{2}\left(\frac{\partial \phi}{\partial t}\right)^{2}-\frac{1}{2}(\vec{\nabla} \phi)^{2}-\frac{m^{2}}{2} \cdot \phi^{2} \tag{40.10} \end{equation*}(40.10)Lm=12(ϕt)212(ϕ)2m22ϕ2
This can be interpreted as the difference between a kinetic energy density T T TTT and a potential energy density V V VVV. The first term is the kinetic energy, proportional to the time derivative of the field. The second is the potential energy cost for a field changing in space. The third term represents the energy cost of the field existing in spacetime at all.
It is important to note that, if there is more than one field present, we must write the action for all of the fields in terms of the sum of their Lagrangians along with a Lagrangian for any interactions between the fields. So if we have two fields ϕ ϕ phi\phiϕ and ψ ψ psi\psiψ then the action is written
S = d 4 x L = d 4 x [ L m ( ϕ , ϕ , μ ) + L m ( ψ , ψ , μ ) + L int ( ϕ , ψ ) ] , S = d 4 x L = d 4 x L m ϕ , ϕ , μ + L m ( ψ , ψ , μ ) + L int  ( ϕ , ψ ) , S=intd^(4)xL=intd^(4)x[L_(m)(phi,phi_(,mu))+L_(m)(psi,psi,mu)+L_("int ")(phi,psi)],quadS=\int \mathrm{d}^{4} x \mathcal{L}=\int \mathrm{d}^{4} x\left[\mathcal{L}_{\mathrm{m}}\left(\phi, \phi_{, \mu}\right)+\mathcal{L}_{\mathrm{m}}(\psi, \psi, \mu)+\mathcal{L}_{\text {int }}(\phi, \psi)\right], \quadS=d4xL=d4x[Lm(ϕ,ϕ,μ)+Lm(ψ,ψ,μ)+Lint (ϕ,ψ)], (40.11) where L int L int  L_("int ")\mathcal{L}_{\text {int }}Lint  encodes the interaction between the fields. This can also be written as a sum of actions
(40.12) S = S [ ϕ ] + S [ ψ ] + S int [ ϕ , ψ ] (40.12) S = S [ ϕ ] + S [ ψ ] + S int [ ϕ , ψ ] {:(40.12)S=S[phi]+S[psi]+S_(int)[phi","psi]:}\begin{equation*} S=S[\phi]+S[\psi]+S_{\mathrm{int}}[\phi, \psi] \tag{40.12} \end{equation*}(40.12)S=S[ϕ]+S[ψ]+Sint[ϕ,ψ]
As for particles, the equations of motion of a field are found by extremizing the total action by varying the fields and setting the variation in the action to zero. Extremizing the action S S SSS amounts to the condition 6 6 ^(6){ }^{6}6 δ S = δ d 4 x L = d 4 x δ L = 0 δ S = δ d 4 x L = d 4 x δ L = 0 delta S=delta intd^(4)xL=intd^(4)x deltaL=0\delta S=\delta \int \mathrm{d}^{4} x \mathcal{L}=\int \mathrm{d}^{4} x \delta \mathcal{L}=0δS=δd4xL=d4xδL=0, where the variation in the Lagrangian can be calculated by using the well-known rule for differentials. 7 7 ^(7){ }^{7}7
Fig. 40.1 Displacements of a string. The displacement from equilibrium is ψ ( x , t ) ψ ( x , t ) psi(x,t)\psi(x, t)ψ(x,t) and the equation of motion can be derived by considering an element of the string of length d x d x dx\mathrm{d} xdx and mass ρ d x ρ d x rhodx\rho \mathrm{d} xρdx. The figure shows a short section in the middle of the string which is assumed middle of the string which is assumed
to be tethered at either end so that to be tethered
ψ ( 0 , t ) = ψ ( , t ) ψ ( 0 , t ) = ψ ( , t ) psi(0,t)=psi(ℓ,t)\psi(0, t)=\psi(\ell, t)ψ(0,t)=ψ(,t).
6 6 ^(6){ }^{6}6 This only works assuming that the integration measure doesn't change when we vary the matter field. This latter point is where the complication comes in gravitation, since we vary action with respect to the components of the metric field, which in turn changes the integration measure.
7 7 ^(7){ }^{7}7 For a function f ( x , y ) f ( x , y ) f(x,y)f(x, y)f(x,y) we write
δ f = ( f x ) y δ x + ( f y ) x δ y δ f = f x y δ x + f y x δ y delta f=((del f)/(del x))_(y)delta x+((del f)/(del y))_(x)delta y\delta f=\left(\frac{\partial f}{\partial x}\right)_{y} \delta x+\left(\frac{\partial f}{\partial y}\right)_{x} \delta yδf=(fx)yδx+(fy)xδy. (40.13)
8 8 ^(8){ }^{8}8 Recall that for a particle the EulerLagrange equation is
(40.18) L q = d d t ( L q ˙ ) (40.18) L q = d d t L q ˙ {:(40.18)(del L)/(del q)=(d)/((d)t)((del L)/(del(q^(˙)))):}\begin{equation*} \frac{\partial L}{\partial q}=\frac{\mathrm{d}}{\mathrm{~d} t}\left(\frac{\partial L}{\partial \dot{q}}\right) \tag{40.18} \end{equation*}(40.18)Lq=d dt(Lq˙)
Note that using the alternative notation in the field case, we have
(40.19) L m ϕ = μ L m ( μ ϕ ) (40.19) L m ϕ = μ L m μ ϕ {:(40.19)(delL_(m))/(del phi)=del_(mu)(delL_(m))/(del(del_(mu)phi)):}\begin{equation*} \frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial \phi}=\partial_{\mu} \frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial\left(\partial_{\mu} \phi\right)} \tag{40.19} \end{equation*}(40.19)Lmϕ=μLm(μϕ)
9 9 ^(9){ }^{9}9 Note for later that a general wave equation can be written (in units where the wave speed is normalized to unity) as
(40.22) 2 ψ t 2 + 2 ψ = 0 (40.22) 2 ψ t 2 + 2 ψ = 0 {:(40.22)-(del^(2)psi)/(delt^(2))+ vec(grad)^(2)psi=0:}\begin{equation*} -\frac{\partial^{2} \psi}{\partial t^{2}}+\vec{\nabla}^{2} \psi=0 \tag{40.22} \end{equation*}(40.22)2ψt2+2ψ=0
or as 2 ψ = 0 2 ψ = 0 del^(2)psi=0\partial^{2} \psi=02ψ=0 for short. In Chapter 46, we shall see how excitations in the metric field obey this equation, giving us gravitational waves.

Example 40.4

We vary the action S S SSS by changing the fields within a (3+1)-dimensional region D D D\mathcal{D}D of flat spacetime, subject to the condition that the variation of the fields δ ϕ δ ϕ delta phi\delta \phiδϕ vanishes at the boundary of D D D\mathcal{D}D. The variation δ S δ S delta S\delta SδS yields
(40.14) δ S = d 4 x [ L m ϕ δ ϕ + L m ϕ , μ δ ( ϕ , μ ) ] = 0 (40.14) δ S = d 4 x L m ϕ δ ϕ + L m ϕ , μ δ ( ϕ , μ ) = 0 {:(40.14)delta S=intd^(4)x[(delL_(m))/(del phi)delta phi+(delL_(m))/(delphi_(,mu))delta(phi,mu)]=0:}\begin{equation*} \delta S=\int \mathrm{d}^{4} x\left[\frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial \phi} \delta \phi+\frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial \phi_{, \mu}} \delta(\phi, \mu)\right]=0 \tag{40.14} \end{equation*}(40.14)δS=d4x[Lmϕδϕ+Lmϕ,μδ(ϕ,μ)]=0
However, we note that δ ( ϕ , μ ) = ( δ ϕ ) , μ δ ( ϕ , μ ) = ( δ ϕ ) , μ delta(phi,mu)=(delta phi)_(,mu)\delta(\phi, \mu)=(\delta \phi)_{, \mu}δ(ϕ,μ)=(δϕ),μ, so the product rule says that this second term becomes,
(40.15) d 4 x [ ( L m ϕ , μ δ ϕ ) , μ ( L m ϕ , μ ) , μ δ ϕ ] (40.15) d 4 x L m ϕ , μ δ ϕ , μ L m ϕ , μ , μ δ ϕ {:(40.15)intd^(4)x[((delL_(m))/(delphi_(,mu))delta phi)_(,mu)-((delL_(m))/(del phi,mu))_(,mu)delta phi]:}\begin{equation*} \int \mathrm{d}^{4} x\left[\left(\frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial \phi_{, \mu}} \delta \phi\right)_{, \mu}-\left(\frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial \phi, \mu}\right)_{, \mu} \delta \phi\right] \tag{40.15} \end{equation*}(40.15)d4x[(Lmϕ,μδϕ),μ(Lmϕ,μ),μδϕ]
The first term in eqn 40.15 is a total derivative and can, using the divergence theorem, be rewritten as
(40.16) d 4 x ( L m ϕ , μ δ ϕ ) , μ = D d σ ( L m ϕ , μ δ ϕ ) (40.16) d 4 x L m ϕ , μ δ ϕ , μ = D d σ L m ϕ , μ δ ϕ {:(40.16)intd^(4)x((delL_(m))/(delphi_(,mu))delta phi)_(,mu)=int_(delD)dsigma((delL_(m))/(del phi,mu)*delta phi):}\begin{equation*} \int \mathrm{d}^{4} x\left(\frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial \phi_{, \mu}} \delta \phi\right)_{, \mu}=\int_{\partial \mathcal{D}} \mathrm{d} \sigma\left(\frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial \phi, \mu} \cdot \delta \phi\right) \tag{40.16} \end{equation*}(40.16)d4x(Lmϕ,μδϕ),μ=Ddσ(Lmϕ,μδϕ)
where d σ d σ dsigma\mathrm{d} \sigmadσ labels elements of the (three-dimensional) surface on the boundary D D delD\partial \mathcal{D}D of our region D D D\mathcal{D}D. This integral must be zero since δ ϕ δ ϕ delta phi\delta \phiδϕ vanishes at the boundary D D delD\partial \mathcal{D}D. Collecting the remaining terms from eqns 40.14 and 40.15 we see that
(40.17) d 4 x [ L m ϕ ( L m ϕ , μ ) , μ ] δ ϕ = 0 (40.17) d 4 x L m ϕ L m ϕ , μ , μ δ ϕ = 0 {:(40.17)intd^(4)x[(delL_(m))/(del phi)-((delL_(m))/(del phi,mu))_(,mu)]delta phi=0:}\begin{equation*} \int \mathrm{d}^{4} x\left[\frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial \phi}-\left(\frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial \phi, \mu}\right)_{, \mu}\right] \delta \phi=0 \tag{40.17} \end{equation*}(40.17)d4x[Lmϕ(Lmϕ,μ),μ]δϕ=0
This integral is guaranteed to vanish if the integrand does, which yields the EulerLagrange equation for fields.
In Minkowski space, this leads to the Euler-Lagrange equations 8 8 ^(8){ }^{8}8
(40.20) L m ϕ = ( L m ϕ , μ ) , μ (40.20) L m ϕ = L m ϕ , μ , μ {:(40.20)(delL_(m))/(del phi)=((delL_(m))/(delphi_(,mu)))_(,mu):}\begin{equation*} \frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial \phi}=\left(\frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial \phi_{, \mu}}\right)_{, \mu} \tag{40.20} \end{equation*}(40.20)Lmϕ=(Lmϕ,μ),μ
This closely resembles the version for particles we've used previously.

Example 40.5

For the string in Example 40.2 we then have
0 = L m ψ x L m ( ψ / x ) t L m ( ψ / t ) (40.21) = 0 + T 2 ψ x 2 ρ 2 ψ t 2 0 = L m ψ x L m ( ψ / x ) t L m ( ψ / t ) (40.21) = 0 + T 2 ψ x 2 ρ 2 ψ t 2 {:[0=(delL_(m))/(del psi)-(del)/(del x)(delL_(m))/(del(del psi//del x))-(del)/(del t)(delL_(m))/(del(del psi//del t))],[(40.21)=0+T(del^(2)psi)/(delx^(2))-rho(del^(2)psi)/(delt^(2))]:}\begin{align*} 0 & =\frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial \psi}-\frac{\partial}{\partial x} \frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial(\partial \psi / \partial x)}-\frac{\partial}{\partial t} \frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial(\partial \psi / \partial t)} \\ & =0+\mathcal{T} \frac{\partial^{2} \psi}{\partial x^{2}}-\rho \frac{\partial^{2} \psi}{\partial t^{2}} \tag{40.21} \end{align*}0=LmψxLm(ψ/x)tLm(ψ/t)(40.21)=0+T2ψx2ρ2ψt2
and so the wave equation ( 2 ψ / x 2 ) = ( 1 / v 2 ) ( 2 ψ / t 2 ) 2 ψ / x 2 = 1 / v 2 2 ψ / t 2 (del^(2)psi//delx^(2))=(1//v^(2))(del^(2)psi//delt^(2))\left(\partial^{2} \psi / \partial x^{2}\right)=\left(1 / v^{2}\right)\left(\partial^{2} \psi / \partial t^{2}\right)(2ψ/x2)=(1/v2)(2ψ/t2) with v = T / ρ v = T / ρ v=sqrt(T//rho)v=\sqrt{\mathcal{T} / \rho}v=T/ρ emerges almost effortlessly. 9 9 ^(9){ }^{9}9 We say that the field ψ ψ psi\psiψ hosts wave-like excitations.

Example 40.6

For the free scalar field in Minkowski space from Example 40.3 we find L m ϕ = m 2 ϕ , L m ϕ , μ = μ ϕ L m ϕ = m 2 ϕ , L m ϕ , μ = μ ϕ (delL_(m))/(del phi)=-m^(2)phi,quad(delL_(m))/(delphi_(,mu))=-del^(mu)phi\frac{\partial \mathcal{L}_{m}}{\partial \phi}=-m^{2} \phi, \quad \frac{\partial \mathcal{L}_{m}}{\partial \phi_{, \mu}}=-\partial^{\mu} \phiLmϕ=m2ϕ,Lmϕ,μ=μϕ,
leading to the equation of motion
( 2 + m 2 ) ϕ = 0 2 + m 2 ϕ = 0 (-del^(2)+m^(2))phi=0\left(-\partial^{2}+m^{2}\right) \phi=0(2+m2)ϕ=0,
where 2 ϕ = μ μ = g μ ν μ ν = 2 ϕ t 2 + 2 ϕ x 2 + 2 ϕ y 2 + 2 ϕ x 2 2 ϕ = μ μ = g μ ν μ ν = 2 ϕ t 2 + 2 ϕ x 2 + 2 ϕ y 2 + 2 ϕ x 2 del^(2)phi=del_(mu)del^(mu)=g^(mu nu)del_(mu)del_(nu)=-(del^(2)phi)/(delt^(2))+(del^(2)phi)/(delx^(2))+(del^(2)phi)/(dely^(2))+(del^(2)phi)/(delx^(2))\partial^{2} \phi=\partial_{\mu} \partial^{\mu}=g^{\mu \nu} \partial_{\mu} \partial_{\nu}=-\frac{\partial^{2} \phi}{\partial t^{2}}+\frac{\partial^{2} \phi}{\partial x^{2}}+\frac{\partial^{2} \phi}{\partial y^{2}}+\frac{\partial^{2} \phi}{\partial x^{2}}2ϕ=μμ=gμνμν=2ϕt2+2ϕx2+2ϕy2+2ϕx2. The equation of motion is known as the Klein-Gordon equation.

40.3 Fields in curved spacetime

When spacetime is curved, the usual flat-space volume element d 4 x d 4 x d^(4)x\mathrm{d}^{4} xd4x is replaced by the invariant element 10 g d 4 x 10 g d 4 x ^(10)sqrt(-g)d^(4)x{ }^{10} \sqrt{-g} \mathrm{~d}^{4} x10g d4x. For a scalar matter field ϕ ϕ phi\phiϕ, the action functional is given by
(40.25) S [ ϕ ] = d 4 x g L m ( ϕ , ϕ x μ ) (40.25) S [ ϕ ] = d 4 x g L m ϕ , ϕ x μ {:(40.25)S[phi]=intd^(4)xsqrt(-g)L_(m)(phi,(del phi)/(delx^(mu))):}\begin{equation*} S[\phi]=\int \mathrm{d}^{4} x \sqrt{-g} \mathcal{L}_{\mathrm{m}}\left(\phi, \frac{\partial \phi}{\partial x^{\mu}}\right) \tag{40.25} \end{equation*}(40.25)S[ϕ]=d4xgLm(ϕ,ϕxμ)
When spacetime is curved the partial derivative μ μ del_(mu)\partial_{\mu}μ is upgraded to the covariant derivative μ μ grad_(mu)\boldsymbol{\nabla}_{\mu}μ. For a scalar we note that μ ϕ = μ ϕ μ ϕ = μ ϕ del_(mu)phi=grad_(mu)phi\partial_{\mu} \phi=\boldsymbol{\nabla}_{\mu} \phiμϕ=μϕ and so our previous equation for the action remains valid with L m ( ϕ , ϕ , μ ) L m ϕ , ϕ , μ L_(m)(phi,phi_(,mu))rarr\mathcal{L}_{\mathrm{m}}\left(\phi, \phi_{, \mu}\right) \rightarrowLm(ϕ,ϕ,μ) L m ( ϕ , ϕ ; μ ) L m ϕ , ϕ ; μ L_(m)(phi,phi_(;mu))\mathcal{L}_{\mathrm{m}}\left(\phi, \phi_{; \mu}\right)Lm(ϕ,ϕ;μ). We might guess that with the inclusion of the covariant derivative, the Euler-Lagrange equations for scalar fields should look like
(40.26) L m ϕ = μ L m ( μ ϕ ) (40.26) L m ϕ = μ L m μ ϕ {:(40.26)(delL_(m))/(del phi)=grad_(mu)(delL_(m))/(del(grad_(mu)phi)):}\begin{equation*} \frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial \phi}=\nabla_{\mu} \frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial\left(\nabla_{\mu} \phi\right)} \tag{40.26} \end{equation*}(40.26)Lmϕ=μLm(μϕ)
It does.
When applied to vector or tensor fields, the Euler-Lagrange equation outputs the equations of motion for each component of a field. This is because the Lagrangian machinery works in terms of components. In generalizing the above equation, we therefore need a notation that picks out the components of the covariant derivative. This is the useful semicolon notation, 11 11 ^(11){ }^{11}11 which allows us to write the Euler-Lagrange equation for scalar fields as
(40.27) L m ϕ = ( L m ϕ ; μ ) ; μ (40.27) L m ϕ = L m ϕ ; μ ; μ {:(40.27)(delL_(m))/(del phi)=((delL_(m))/(delphi_(;mu)))_(;mu):}\begin{equation*} \frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial \phi}=\left(\frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial \phi_{; \mu}}\right)_{; \mu} \tag{40.27} \end{equation*}(40.27)Lmϕ=(Lmϕ;μ);μ
We therefore infer that the equations of motion for the components of a vector field A A A\boldsymbol{A}A are given by the Euler-Lagrange equation
(40.28) L m A ν = ( L m A ν ; μ ) ; μ (40.28) L m A ν = L m A ν ; μ ; μ {:(40.28)(delL_(m))/(delA^(nu))=((delL_(m))/(delA^(nu)_(;mu)))_(;mu):}\begin{equation*} \frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial A^{\nu}}=\left(\frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial A^{\nu}{ }_{; \mu}}\right)_{; \mu} \tag{40.28} \end{equation*}(40.28)LmAν=(LmAν;μ);μ
Dealing with the equations of motion of matter fields is fairly straightforward in curved spacetime. The complication compared to Minkowski space being that we must be careful to include the components of the metric field in our manipulations. 12 12 ^(12){ }^{12}12
Example 40.7
For the scalar field in curved spacetime we have the Lagrangian density
(40.29) L m = 1 2 ϕ ; μ ϕ ; ν g μ ν m 2 2 ϕ 2 (40.29) L m = 1 2 ϕ ; μ ϕ ; ν g μ ν m 2 2 ϕ 2 {:(40.29)L_(m)=-(1)/(2)phi_(;mu)phi_(;nu)g^(mu nu)-(m^(2))/(2)phi^(2):}\begin{equation*} \mathcal{L}_{\mathrm{m}}=-\frac{1}{2} \phi_{; \mu} \phi_{; \nu} g^{\mu \nu}-\frac{m^{2}}{2} \phi^{2} \tag{40.29} \end{equation*}(40.29)Lm=12ϕ;μϕ;νgμνm22ϕ2
We have L m / ϕ = m 2 ϕ L m / ϕ = m 2 ϕ delL_(m)//del phi=-m^(2)phi\partial \mathcal{L}_{\mathrm{m}} / \partial \phi=-m^{2} \phiLm/ϕ=m2ϕ. Differentiating L L L\mathcal{L}L with respect to μ ϕ = ϕ ; μ μ ϕ = ϕ ; μ grad_(mu)phi=phi_(;mu)\nabla_{\mu} \phi=\phi_{; \mu}μϕ=ϕ;μ gives us
(40.30) ( L m ϕ ; μ ) = g μ ν ϕ ; ν (40.30) L m ϕ ; μ = g μ ν ϕ ; ν {:(40.30)((delL_(m))/(delphi_(;mu)))=-g^(mu nu)phi_(;nu):}\begin{equation*} \left(\frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial \phi_{; \mu}}\right)=-g^{\mu \nu} \phi_{; \nu} \tag{40.30} \end{equation*}(40.30)(Lmϕ;μ)=gμνϕ;ν
from which we conclude 13 13 ^(13){ }^{13}13
(40.32) ( L m ϕ ; μ ) ; μ = g μ ν ϕ ; ν μ . (40.32) L m ϕ ; μ ; μ = g μ ν ϕ ; ν μ . {:(40.32)((delL_(m))/(delphi_(;mu)))_(;mu)=-g^(mu nu)phi_(;nu mu).:}\begin{equation*} \left(\frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial \phi_{; \mu}}\right)_{; \mu}=-g^{\mu \nu} \phi_{; \nu \mu} . \tag{40.32} \end{equation*}(40.32)(Lmϕ;μ);μ=gμνϕ;νμ.
10 10 ^(10){ }^{10}10 The volume 4 -form, written in the coordinate frame, is
ω ~ = g d t ( ) d x ( ) d y ( ) d z ( ) ω ~ = g d t ( ) d x ( ) d y ( ) d z ( ) tilde(omega)=sqrt(-g)dt()^^dx()^^dy()^^dz()\tilde{\boldsymbol{\omega}}=\sqrt{-g} \boldsymbol{d} t() \wedge \boldsymbol{d} x() \wedge \boldsymbol{d} y() \wedge \boldsymbol{d} z()ω~=gdt()dx()dy()dz().
Inserting Cartesian infinitesimal vectors d t e t , d x e x , d y e y d t e t , d x e x , d y e y dte_(t),dxe_(x),dye_(y)\mathrm{d} t \boldsymbol{e}_{t}, \mathrm{~d} x \boldsymbol{e}_{x}, \mathrm{~d} y \boldsymbol{e}_{y}dtet, dxex, dyey and d z e z d z e z dze_(z)\mathrm{d} z \boldsymbol{e}_{z}dzez, yields the volume element g d t d x d y d z g d t d x d y d z sqrt(-g)dtdxdydz\sqrt{-g} \mathrm{~d} t \mathrm{~d} x \mathrm{~d} y \mathrm{~d} zg dt dx dy dz.
11 11 ^(11){ }^{11}11 Remember A ν ; μ = μ A ν + Γ ν μ β A β A ν ; μ = μ A ν + Γ ν μ β A β A^(nu)_(;mu)=del_(mu)A^(nu)+Gamma^(nu)_(mu beta)A^(beta)A^{\nu}{ }_{; \mu}=\partial_{\mu} A^{\nu}+\Gamma^{\nu}{ }_{\mu \beta} A^{\beta}Aν;μ=μAν+ΓνμβAβ and A ν ; μ = μ A ν Γ β μ ν A β A ν ; μ = μ A ν Γ β μ ν A β A_(nu;mu)=del_(mu)A_(nu)-Gamma^(beta)_(mu nu)A_(beta)A_{\nu ; \mu}=\partial_{\mu} A_{\nu}-\Gamma^{\beta}{ }_{\mu \nu} A_{\beta}Aν;μ=μAνΓβμνAβ.
12 12 ^(12){ }^{12}12 Here it is helpful to remember that the partial derivative μ μ del_(mu)\partial_{\mu}μ is naturally lowered, which is to say that the components are in the down position. This is also true of the covariant derivative μ μ grad_(mu)\boldsymbol{\nabla}_{\mu}μ. We therefore make products of μ μ grad_(mu)\nabla_{\mu}μ. We therefore make products of
these derivatives using the all-up comthese derivatives using the all-up com-
ponents of the ( 0 , 2 ) ( 0 , 2 ) (0,2)(0,2)(0,2) metric g μ ν g μ ν g^(mu nu)g^{\mu \nu}gμν. It is ponents of the ( 0 , 2 ) ( 0 , 2 ) (0,2)(0,2)(0,2) metric g μ ν g μ ν g^(mu nu)g^{\mu \nu}gμν. It is
also worth noting (since we usually deal also worth noting (since we usually deal
with the all-down components of the with the all-down components of the
metric g g ggg ) that it is only in the case that the metric is diagonal that we have g μ ν = 1 / g μ ν g μ ν = 1 / g μ ν g_(mu nu)=1//g^(mu nu)g_{\mu \nu}=1 / g^{\mu \nu}gμν=1/gμν.
13 13 ^(13){ }^{13}13 The semicolon notation is especially useful here. Explicitly we have that
ϕ ; μ ν = ( ν ϕ ) ; μ (40.31) = μ ν ϕ Γ σ μ ν σ ϕ . ϕ ; μ ν = ν ϕ ; μ (40.31) = μ ν ϕ Γ σ μ ν σ ϕ . {:[phi_(;mu nu)=(del_(nu)phi)_(;mu)],[(40.31)=del_(mu)del_(nu)phi-Gamma^(sigma)_(mu nu)del_(sigma)phi.]:}\begin{align*} \phi_{; \mu \nu} & =\left(\partial_{\nu} \phi\right)_{; \mu} \\ & =\partial_{\mu} \partial_{\nu} \phi-\Gamma^{\sigma}{ }_{\mu \nu} \partial_{\sigma} \phi . \tag{40.31} \end{align*}ϕ;μν=(νϕ);μ(40.31)=μνϕΓσμνσϕ.
14 14 ^(14){ }^{14}14 The constant is used here to ensure that however we have defined the matter fields, when combined with the Einstein-Hilbert action they correctly produce the 8 π 8 π 8pi8 \pi8π term in the Einstein equation G = 8 π T G = 8 π T G=8pi T\boldsymbol{G}=8 \pi \boldsymbol{T}G=8πT.
15 15 ^(15){ }^{15}15 We state this without proof, but the inclusion of the Ricci scalar field R ( x ) R ( x ) R(x)R(x)R(x) is perhaps not completely unexpected By the principle of general covariance, we would be looking for a field which transforms properly, and we might ex pect something fairly simple and fundamental to appear there. As we shall see though, the proof of the pudding is that it produces the right answer.
The equation of motion is, therefore
(40.33) g μ ν ϕ ; ν μ m 2 ϕ = 0 . (40.33) g μ ν ϕ ; ν μ m 2 ϕ = 0 . {:(40.33)g^(mu nu)phi_(;nu mu)-m^(2)phi=0.:}\begin{equation*} g^{\mu \nu} \phi_{; \nu \mu}-m^{2} \phi=0 . \tag{40.33} \end{equation*}(40.33)gμνϕ;νμm2ϕ=0.
We therefore have a systematic means of dealing with matter fields in curved spacetime.

40.4 Motivating the Einstein equation

How do we deal with the geometry of spacetime, expressed via g ( x ) g ( x ) g(x)\boldsymbol{g}(x)g(x) in the Lagrangian formalism? This is the problem that was solved by David Hilbert, with the solution (now called) the Einstein-Hilbert action S EH S EH S_(EH)S_{\mathrm{EH}}SEH. The idea is that we need to add this new term S EH S EH  S_("EH ")S_{\text {EH }}SEH  that reflects the g ( x ) g ( x ) g(x)\boldsymbol{g}(x)g(x) field to the action S m S m S_(m)S_{\mathrm{m}}Sm of the system of matter fields. The total action is then
(40.34) S = S EH + α S m , (40.34) S = S EH + α S m , {:(40.34)S=S_(EH)+alphaS_(m)",":}\begin{equation*} S=S_{\mathrm{EH}}+\alpha S_{\mathrm{m}}, \tag{40.34} \end{equation*}(40.34)S=SEH+αSm,
where α α alpha\alphaα is a constant 14 14 ^(14){ }^{14}14 and S m S m S_(m)S_{\mathrm{m}}Sm contains the contributions from all of the matter fields in the Universe. The equation of motion (a.k.a. the Einstein equation) then results from the variation of the total action, found by setting δ S = 0 δ S = 0 delta S=0\delta S=0δS=0. The variation of the matter fields should be expected to supply the right-hand side of the Einstein equation reflecting the massenergy in the Universe, while the variation of the Einstein-Hilbert action should give us the geometrical left-hand side.
A complication here is that, unfortunately, we can't simply guess a total Lagrangian and feed this into the Euler-Lagrange equations to extract the Einstein equation. This is because the field that we must vary, subject to the condition that δ S = 0 δ S = 0 delta S=0\delta S=0δS=0, is the metric field g ( x ) g ( x ) g(x)\boldsymbol{g}(x)g(x). The variation therefore has the effect of changing the intervals between points in spacetime itself and, in particular, changing the volume element g d 4 x g d 4 x sqrt(-g)d^(4)x\sqrt{-g} \mathrm{~d}^{4} xg d4x in the action interval. We must therefore vary the total action with respect to the components of the field g μ ν g μ ν g_(mu nu)g_{\mu \nu}gμν of the metric from first principles, such that δ S δ S delta S\delta SδS vanishes. It is also important that we vary the action with respect to g μ ν g μ ν g_(mu nu)g_{\mu \nu}gμν and not the all-up components g μ ν g μ ν g^(mu nu)g^{\mu \nu}gμν. (This is because the gravitational field is most directly represented by the ( 0 , 2 ) ( 0 , 2 ) (0,2)(0,2)(0,2) tensor field g ( g ( g(\boldsymbol{g}(g(, ) i n w h i c h w e i n p u t v e c t o r s . ) W e t h e r e f o r e s e e k ) i n w h i c h w e i n p u t v e c t o r s . ) W e t h e r e f o r e s e e k )inwhichweinputvectors.)Wethereforeseek) in which we input vectors.) We therefore seek)inwhichweinputvectors.)Wethereforeseek
(40.35) δ S = d 4 x ( g L ) g μ ν δ g μ ν = 0 (40.35) δ S = d 4 x ( g L ) g μ ν δ g μ ν = 0 {:(40.35)delta S=intd^(4)x(del(sqrt(-g)L))/(delg_(mu nu))deltag_(mu nu)=0:}\begin{equation*} \delta S=\int \mathrm{d}^{4} x \frac{\partial(\sqrt{-g} \mathcal{L})}{\partial g_{\mu \nu}} \delta g_{\mu \nu}=0 \tag{40.35} \end{equation*}(40.35)δS=d4x(gL)gμνδgμν=0
After all of these preliminaries, we are now ready to meet Hilbert's famous result. 15 15 ^(15){ }^{15}15 The Lagrangian density for geometry is given by the Ricci scalar field R ( x ) R ( x ) R(x)R(x)R(x), such that the Einstein-Hilbert action is written as
(40.36) S EH = d 4 x g ( x ) R ( x ) (40.36) S EH = d 4 x g ( x ) R ( x ) {:(40.36)S_(EH)=intd^(4)xsqrt(-g(x))R(x):}\begin{equation*} S_{\mathrm{EH}}=\int \mathrm{d}^{4} x \sqrt{-g(x)} R(x) \tag{40.36} \end{equation*}(40.36)SEH=d4xg(x)R(x)
This is added to the matter action α S m = d 4 x g ( x ) α L m α S m = d 4 x g ( x ) α L m alphaS_(m)=intd^(4)xsqrt(-g(x))alphaL_(m)\alpha S_{\mathrm{m}}=\int \mathrm{d}^{4} x \sqrt{-g(x)} \alpha \mathcal{L}_{\mathrm{m}}αSm=d4xg(x)αLm, where α α alpha\alphaα is the constant discussed above. Varying this total Lagrangian with
respect to the components of the metric results in the Einstein equations describing the dynamics that link matter fields and geometrical fields.
In order to vary the geometry Lagrangian in eqn 40.36, we shall need three new mathematical tricks: two are rules for the variation of the metric tensor and a third is for varying the Riemann tensor. The first comes from our need to vary the action with respect to the components g μ ν g μ ν g_(mu nu)g_{\mu \nu}gμν. It turns out that this forces us to also consider the result of varying components g μ ν g μ ν g^(mu nu)g^{\mu \nu}gμν. The rule we need is that there's an extra minus sign relating the variation of the up and down versions of the vector
(40.37) δ g σ ρ = g σ μ δ g μ ν g ν ρ (Rule 1). (40.37) δ g σ ρ = g σ μ δ g μ ν g ν ρ  (Rule 1).  {:(40.37)deltag^(sigma rho)=-g^(sigma mu)deltag_(mu nu)g^(nu rho)quad" (Rule 1). ":}\begin{equation*} \delta g^{\sigma \rho}=-g^{\sigma \mu} \delta g_{\mu \nu} g^{\nu \rho} \quad \text { (Rule 1). } \tag{40.37} \end{equation*}(40.37)δgσρ=gσμδgμνgνρ (Rule 1). 
We prove this below.
Example 40.8
The identity can be proven by considering the variation of the matrix equation M M 1 = I M _ M _ 1 = I M_M_^(-1)=I\underline{\boldsymbol{M}} \underline{\boldsymbol{M}}^{-1}=IMM1=I to give
(40.38) ( δ M ) M 1 + M δ M 1 = 0 (40.38) ( δ M _ ) M _ 1 + M _ δ M _ 1 = 0 {:(40.38)(deltaM_)M_^(-1)+M_deltaM_^(-1)=0:}\begin{equation*} (\delta \underline{M}) \underline{M}^{-1}+\underline{M} \delta \underline{M}^{-1}=0 \tag{40.38} \end{equation*}(40.38)(δM)M1+MδM1=0
From this we derive the matrix equation
(40.39) δ M 1 = M 1 ( δ M ) M 1 (40.39) δ M _ 1 = M _ 1 ( δ M _ ) M _ 1 {:(40.39)deltaM_^(-1)=-M_^(-1)(deltaM_)M_^(-1):}\begin{equation*} \delta \underline{\boldsymbol{M}}^{-1}=-\underline{\boldsymbol{M}}^{-1}(\delta \underline{M}) \underline{\boldsymbol{M}}^{-1} \tag{40.39} \end{equation*}(40.39)δM1=M1(δM)M1
Note the important minus sign! Representing the components of the metric as a matrix with M μ ν = g μ ν M _ μ ν = g μ ν M__(mu nu)=g_(mu nu)\underline{\boldsymbol{M}}_{\mu \nu}=g_{\mu \nu}Mμν=gμν and ( M 1 ) μ ν = g μ ν M _ 1 μ ν = g μ ν (M_^(-1))_(mu nu)=g^(mu nu)\left(\underline{\boldsymbol{M}}^{-1}\right)_{\mu \nu}=g^{\mu \nu}(M1)μν=gμν, we must then have
(40.40) δ g α δ = g α β δ g β γ g γ δ (40.40) δ g α δ = g α β δ g β γ g γ δ {:(40.40)deltag^(alpha delta)=-g^(alpha beta)deltag_(beta gamma)g^(gamma delta):}\begin{equation*} \delta g^{\alpha \delta}=-g^{\alpha \beta} \delta g_{\beta \gamma} g^{\gamma \delta} \tag{40.40} \end{equation*}(40.40)δgαδ=gαβδgβγgγδ
Conclusion: There's an extra minus sign when you vary the up-index version of the metric.
The second rule we need is for the variation of a determinant
(40.41) δ g = 1 2 g g μ ν δ g μ ν ( Rule 2 ) (40.41) δ g = 1 2 g g μ ν δ g μ ν (  Rule  2 ) {:(40.41)deltasqrt(-g)=(1)/(2)sqrt(-g)g^(mu nu)deltag_(mu nu)quad(" Rule "2):}\begin{equation*} \delta \sqrt{-g}=\frac{1}{2} \sqrt{-g} g^{\mu \nu} \delta g_{\mu \nu} \quad(\text { Rule } 2) \tag{40.41} \end{equation*}(40.41)δg=12ggμνδgμν( Rule 2)

Example 40.9

To prove this we use the identity 16 ln det M = Tr ln M 16 ln det M _ = Tr ln M _ ^(16)ln detM_=Tr ln M_{ }^{16} \ln \operatorname{det} \underline{M}=\operatorname{Tr} \ln \underline{M}16lndetM=TrlnM. Differentiate the identity to find
δ det M = det M δ ( Tr ln M ) (40.42) = det M Tr ( M 1 δ M ) δ det M _ = det M _ δ ( Tr ln M _ ) (40.42) = det M _ Tr M _ 1 δ M _ {:[delta detM_=detM_delta(Tr ln M_)],[(40.42)=detM_Tr(M_^(-1)deltaM_)]:}\begin{align*} \delta \operatorname{det} \underline{\boldsymbol{M}} & =\operatorname{det} \underline{\boldsymbol{M}} \delta(\operatorname{Tr} \ln \underline{\boldsymbol{M}}) \\ & =\operatorname{det} \underline{\boldsymbol{M}} \operatorname{Tr}\left(\underline{\boldsymbol{M}}^{-1} \delta \underline{\boldsymbol{M}}\right) \tag{40.42} \end{align*}δdetM=detMδ(TrlnM)(40.42)=detMTr(M1δM)
Plugging in the metric leads to the equation claimed above.
The third and final rule we shall need is the Palatini identity 17 17 ^(17){ }^{17}17 which says that the variation of components of the Ricci tensor R α β R α β R_(alpha beta)R_{\alpha \beta}Rαβ can be written in terms of covariant derivatives of the variation of the connection coefficients. The identity reads
(40.43) δ R α β = μ ( δ Γ μ α β ) β ( δ Γ μ α μ ) (Rule 3). (40.43) δ R α β = μ δ Γ μ α β β δ Γ μ α μ  (Rule 3).  {:(40.43)deltaR_(alpha beta)=grad_(mu)(deltaGamma^(mu)_(alpha beta))-grad_(beta)(deltaGamma^(mu)_(alpha mu))quad" (Rule 3). ":}\begin{equation*} \delta R_{\alpha \beta}=\nabla_{\mu}\left(\delta \Gamma^{\mu}{ }_{\alpha \beta}\right)-\nabla_{\beta}\left(\delta \Gamma^{\mu}{ }_{\alpha \mu}\right) \quad \text { (Rule 3). } \tag{40.43} \end{equation*}(40.43)δRαβ=μ(δΓμαβ)β(δΓμαμ) (Rule 3). 
16 16 ^(16){ }^{16}16 The proof of this identity, which can be found in many sources, is left as an exercise. Alternatively, one can follow Exercise 34.3.
17 17 ^(17){ }^{17}17 Attilio Palatini (1889-1949) showed that the variations of the connection coefficients are the coordinate components of a tensor. We won't reproduce the derivation here, but note that the Palatini identity in eqn 40.43 is proved in Zee (2013).
With these three identities under our belts, we can vary the EinsteinHilbert action. There's nothing for it now but to plunge into the algebra.

Example 40.10

18 18 ^(18){ }^{18}18 That is, we start from R = g α β R α β R = g α β R α β R=g^(alpha beta)R_(alpha beta)R=g^{\alpha \beta} R_{\alpha \beta}R=gαβRαβ. Written out in terms of components of the Ricci tensor, 18 18 ^(18){ }^{18}18 we have that the EinsteinHilbert action comprises three objects
(40.44) S EH = d 4 x g g σ ρ R σ ρ . (40.44) S EH = d 4 x g g σ ρ R σ ρ . {:(40.44)S_(EH)=intd^(4)xsqrt(-g)g^(sigma rho)R_(sigma rho).:}\begin{equation*} S_{\mathrm{EH}}=\int \mathrm{d}^{4} x \sqrt{-g} g^{\sigma \rho} R_{\sigma \rho} . \tag{40.44} \end{equation*}(40.44)SEH=d4xggσρRσρ.
Varying the action gets us three terms, one for each object
δ S = d 4 x [ g g σ ρ δ R σ ρ + g δ g σ ρ R σ ρ + δ ( g ) R ] (40.45) = δ I 1 + δ I 2 + δ I 3 = 0 . δ S = d 4 x g g σ ρ δ R σ ρ + g δ g σ ρ R σ ρ + δ ( g ) R (40.45) = δ I 1 + δ I 2 + δ I 3 = 0 . {:[delta S=intd^(4)x[sqrt(-g)g^(sigma rho)deltaR_(sigma rho)+sqrt(-g)deltag^(sigma rho)R_(sigma rho)+delta(sqrt(-g))R]],[(40.45)=deltaI_(1)+deltaI_(2)+deltaI_(3)=0.]:}\begin{align*} \delta S & =\int \mathrm{d}^{4} x\left[\sqrt{-g} g^{\sigma \rho} \delta R_{\sigma \rho}+\sqrt{-g} \delta g^{\sigma \rho} R_{\sigma \rho}+\delta(\sqrt{-g}) R\right] \\ & =\delta I_{1}+\delta I_{2}+\delta I_{3}=0 . \tag{40.45} \end{align*}δS=d4x[ggσρδRσρ+gδgσρRσρ+δ(g)R](40.45)=δI1+δI2+δI3=0.
We deal with these in reverse order. For δ I 3 δ I 3 deltaI_(3)\delta I_{3}δI3 we need Rule 2 from eqn 40.37, which yields
δ I 3 = d 4 x δ ( g ) R = 1 2 d 4 x g g μ ν δ g μ ν R (40.46) = d 4 x g ( 1 2 g μ ν R ) δ g μ ν δ I 3 = d 4 x δ ( g ) R = 1 2 d 4 x g g μ ν δ g μ ν R (40.46) = d 4 x g 1 2 g μ ν R δ g μ ν {:[deltaI_(3)=intd^(4)x delta(sqrt(-g))R],[=(1)/(2)intd^(4)xsqrt(-g)g^(mu nu)deltag_(mu nu)R],[(40.46)=intd^(4)xsqrt(-g)((1)/(2)g^(mu nu)R)deltag_(mu nu)]:}\begin{align*} \delta I_{3} & =\int \mathrm{d}^{4} x \delta(\sqrt{-g}) R \\ & =\frac{1}{2} \int \mathrm{~d}^{4} x \sqrt{-g} g^{\mu \nu} \delta g_{\mu \nu} R \\ & =\int \mathrm{d}^{4} x \sqrt{-g}\left(\frac{1}{2} g^{\mu \nu} R\right) \delta g_{\mu \nu} \tag{40.46} \end{align*}δI3=d4xδ(g)R=12 d4xggμνδgμνR(40.46)=d4xg(12gμνR)δgμν
For δ I 2 δ I 2 deltaI_(2)\delta I_{2}δI2 we need Rule 1 (eqn 40.41) for lowering the indices of the components of the metric. Applying this, we obtain
δ I 2 = d 4 x g R σ ρ ( g σ μ δ g μ ν g ν ρ ) (40.47) = d 4 x g R μ ν δ g μ ν . δ I 2 = d 4 x g R σ ρ g σ μ δ g μ ν g ν ρ (40.47) = d 4 x g R μ ν δ g μ ν . {:[deltaI_(2)=intd^(4)xsqrt(-g)R_(sigma rho)(-g^(sigma mu)deltag_(mu nu)g^(nu rho))],[(40.47)=-intd^(4)xsqrt(-g)R^(mu nu)deltag_(mu nu).]:}\begin{align*} \delta I_{2} & =\int \mathrm{d}^{4} x \sqrt{-g} R_{\sigma \rho}\left(-g^{\sigma \mu} \delta g_{\mu \nu} g^{\nu \rho}\right) \\ & =-\int \mathrm{d}^{4} x \sqrt{-g} R^{\mu \nu} \delta g_{\mu \nu} . \tag{40.47} \end{align*}δI2=d4xgRσρ(gσμδgμνgνρ)(40.47)=d4xgRμνδgμν.
The result in terms of δ g μ ν δ g μ ν deltag_(mu nu)\delta g_{\mu \nu}δgμν and R μ ν R μ ν R^(mu nu)R^{\mu \nu}Rμν confirms that writing δ I 2 δ I 2 deltaI_(2)\delta I_{2}δI2 in terms of R σ ρ R σ ρ R_(sigma rho)R_{\sigma \rho}Rσρ and not R σ ρ R σ ρ R^(sigma rho)R^{\sigma \rho}Rσρ was the correct choice: the latter would not have had indices consistent with the term from δ I 3 δ I 3 deltaI_(3)\delta I_{3}δI3.
The most problematical part is the final integral for δ I 1 δ I 1 deltaI_(1)\delta I_{1}δI1. Fortunately, this only contributes a surface term which vanishes. The key to seeing this is to use the Palatini identity (Rule 3) which says
Integrating this
(40.48) δ R ν β = δ R ν μ β μ = μ δ Γ ν β μ β δ Γ μ ν μ . (40.48) δ R ν β = δ R ν μ β μ = μ δ Γ ν β μ β δ Γ μ ν μ . {:(40.48)deltaR_(nu beta)=deltaR_(nu mu beta)^(mu)=grad_(mu)deltaGamma_(nu beta)^(mu)-grad_(beta)deltaGamma^(mu)_(nu mu).:}\begin{equation*} \delta R_{\nu \beta}=\delta R_{\nu \mu \beta}^{\mu}=\nabla_{\mu} \delta \Gamma_{\nu \beta}^{\mu}-\nabla_{\beta} \delta \Gamma^{\mu}{ }_{\nu \mu} . \tag{40.48} \end{equation*}(40.48)δRνβ=δRνμβμ=μδΓνβμβδΓμνμ.
δ I 2 = d 4 x g g ν β ( μ δ Γ ν β μ β δ Γ μ ν μ ) (40.49) = d 4 x g β ( g ν β δ Γ μ ν μ g ν μ δ Γ β ν μ ) δ I 2 = d 4 x g g ν β μ δ Γ ν β μ β δ Γ μ ν μ (40.49) = d 4 x g β g ν β δ Γ μ ν μ g ν μ δ Γ β ν μ {:[deltaI_(2)=intd^(4)xsqrt(-g)g^(nu beta)(grad_(mu)deltaGamma_(nu beta)^(mu)-grad_(beta)deltaGamma^(mu)_(nu mu))],[(40.49)=-intd^(4)xsqrt(-g)grad_(beta)(g^(nu beta)deltaGamma^(mu)_(nu mu)-g^(nu mu)deltaGamma^(beta)_(nu mu))]:}\begin{align*} \delta I_{2} & =\int \mathrm{d}^{4} x \sqrt{-g} g^{\nu \beta}\left(\nabla_{\mu} \delta \Gamma_{\nu \beta}^{\mu}-\nabla_{\beta} \delta \Gamma^{\mu}{ }_{\nu \mu}\right) \\ & =-\int \mathrm{d}^{4} x \sqrt{-g} \nabla_{\beta}\left(g^{\nu \beta} \delta \Gamma^{\mu}{ }_{\nu \mu}-g^{\nu \mu} \delta \Gamma^{\beta}{ }_{\nu \mu}\right) \tag{40.49} \end{align*}δI2=d4xggνβ(μδΓνβμβδΓμνμ)(40.49)=d4xgβ(gνβδΓμνμgνμδΓβνμ)
using α g μ ν = 0 α g μ ν = 0 grad_(alpha)g^(mu nu)=0\nabla_{\alpha} g^{\mu \nu}=0αgμν=0. Finally, we use the divergence theorem to say that the derivative may be written as a surface integral over the boundary. This vanishes because the variation vanishes on the boundary by definition and so this term does not contribute. The result is that the contribution to the equations of motion is
(40.50) δ I 2 + δ I 3 = d 4 x g ( R μ ν 1 2 g μ ν R ) δ g μ ν (40.50) δ I 2 + δ I 3 = d 4 x g R μ ν 1 2 g μ ν R δ g μ ν {:(40.50)deltaI_(2)+deltaI_(3)=-intd^(4)xsqrt(-g)(R^(mu nu)-(1)/(2)g^(mu nu)R)deltag_(mu nu):}\begin{equation*} \delta I_{2}+\delta I_{3}=-\int \mathrm{d}^{4} x \sqrt{-g}\left(R^{\mu \nu}-\frac{1}{2} g^{\mu \nu} R\right) \delta g_{\mu \nu} \tag{40.50} \end{equation*}(40.50)δI2+δI3=d4xg(Rμν12gμνR)δgμν
The variation of the action should vanish, so we must have that δ S = δ I 2 + δ I 3 = 0 δ S = δ I 2 + δ I 3 = 0 delta S=deltaI_(2)+deltaI_(3)=0\delta S=\delta I_{2}+\delta I_{3}=0δS=δI2+δI3=0. The way to guarantee this is that the integrand must vanish.
The conclusion of this lengthy set of manipulations is that the components of the Einstein tensor G μ ν = ( R μ ν 1 2 g μ ν R ) G μ ν = R μ ν 1 2 g μ ν R G^(mu nu)=(R^(mu nu)-(1)/(2)g^(mu nu)R)G^{\mu \nu}=\left(R^{\mu \nu}-\frac{1}{2} g^{\mu \nu} R\right)Gμν=(Rμν12gμνR) emerge as the contribution to the (Einstein) equation of motion and that they must vanish. This gives the Einstein equation in the slightly stripped-down form R μ ν 1 2 g μ ν R = 0 R μ ν 1 2 g μ ν R = 0 R^(mu nu)-(1)/(2)g^(mu nu)R=0R^{\mu \nu}-\frac{1}{2} g^{\mu \nu} R=0Rμν12gμνR=0. This simplified form arises because we have not yet included the matter fields in the action. The matter fields must therefore spit out the energy-momentum tensor, and so we now turn to examine how the energy-momentum tensor can be derived from the action.

40.5 Energy-momentum tensor

Often the energy-momentum tensor is determined ad hoc, using all of the physical insight we can muster. However, field theory tells us that there should be a way to simply and mechanically extract it from the Lagrangian. There is, as we see in the next example.
Example 40.11
We can gain access to the energy-momentum tensor if we vary the action with respect to the components of the metric. The action is given by
(40.51) S = S EH + α S m = d 4 x g ( L EH + α L m ) (40.51) S = S EH + α S m = d 4 x g L EH + α L m {:(40.51)S=S_(EH)+alphaS_(m)=intd^(4)xsqrt(-g)(L_(EH)+alphaL_(m)):}\begin{equation*} S=S_{\mathrm{EH}}+\alpha S_{\mathrm{m}}=\int \mathrm{d}^{4} x \sqrt{-g}\left(\mathcal{L}_{\mathrm{EH}}+\alpha \mathcal{L}_{\mathrm{m}}\right) \tag{40.51} \end{equation*}(40.51)S=SEH+αSm=d4xg(LEH+αLm)
If we vary with respect to the components of the metric, we obtain
(40.52) d 4 x [ ( g L EH ) g μ ν + α ( g L m ) g μ ν ] δ g μ ν = 0 (40.52) d 4 x g L EH g μ ν + α g L m g μ ν δ g μ ν = 0 {:(40.52)intd^(4)x[(del(sqrt(-g)L_(EH)))/(delg_(mu nu))+alpha(del(sqrt(-g)L_(m)))/(delg_(mu nu))]deltag_(mu nu)=0:}\begin{equation*} \int \mathrm{d}^{4} x\left[\frac{\partial\left(\sqrt{-g} \mathcal{L}_{\mathrm{EH}}\right)}{\partial g_{\mu \nu}}+\alpha \frac{\partial\left(\sqrt{-g} \mathcal{L}_{\mathrm{m}}\right)}{\partial g_{\mu \nu}}\right] \delta g_{\mu \nu}=0 \tag{40.52} \end{equation*}(40.52)d4x[(gLEH)gμν+α(gLm)gμν]δgμν=0
We have from the previous section that
(40.53) d 4 x ( g L EH ) g μ ν δ g μ ν = d 4 x g G μ ν δ g μ ν (40.53) d 4 x g L EH g μ ν δ g μ ν = d 4 x g G μ ν δ g μ ν {:(40.53)intd^(4)x(del(sqrt(-g)L_(EH)))/(delg_(mu nu))deltag_(mu nu)=intd^(4)xsqrt(-g)G^(mu nu)deltag_(mu nu):}\begin{equation*} \int \mathrm{d}^{4} x \frac{\partial\left(\sqrt{-g} \mathcal{L}_{\mathrm{EH}}\right)}{\partial g_{\mu \nu}} \delta g_{\mu \nu}=\int \mathrm{d}^{4} x \sqrt{-g} G^{\mu \nu} \delta g_{\mu \nu} \tag{40.53} \end{equation*}(40.53)d4x(gLEH)gμνδgμν=d4xgGμνδgμν
where we have used the components of the Einstein tensor G μ ν = R μ ν 1 2 g μ ν R G μ ν = R μ ν 1 2 g μ ν R G^(mu nu)=R^(mu nu)-(1)/(2)g^(mu nu)RG^{\mu \nu}=R^{\mu \nu}-\frac{1}{2} g^{\mu \nu} RGμν=Rμν12gμνR.
Comparing eqn 40.52 with the Einstein equation, we make the suggestion
(40.54) T μ ν = 2 g δ ( g L m ) δ g μ ν (40.54) T μ ν = 2 g δ g L m δ g μ ν {:(40.54)T^(mu nu)=(2)/(sqrt(-g))(delta(sqrt(-g)L_(m)))/(deltag_(mu nu)):}\begin{equation*} T^{\mu \nu}=\frac{2}{\sqrt{-g}} \frac{\delta\left(\sqrt{-g} \mathcal{L}_{\mathrm{m}}\right)}{\delta g_{\mu \nu}} \tag{40.54} \end{equation*}(40.54)Tμν=2gδ(gLm)δgμν
We immediately see that this is a good choice, as equating the integrand of eqn 40.52 to zero leads to an expression
(40.55) g G μ ν α g T μ ν 2 = 0 . (40.55) g G μ ν α g T μ ν 2 = 0 . {:(40.55)sqrt(-g)G^(mu nu)-alpha(sqrt(-g)T^(mu nu))/(2)=0.:}\begin{equation*} \sqrt{-g} G^{\mu \nu}-\alpha \frac{\sqrt{-g} T^{\mu \nu}}{2}=0 . \tag{40.55} \end{equation*}(40.55)gGμναgTμν2=0.
We choose α = 16 π α = 16 π alpha=16 pi\alpha=16 \piα=16π, which leads to the Einstein equation G μ ν = 8 π T μ ν G μ ν = 8 π T μ ν G_(mu nu)=8piT_(mu nu)G_{\mu \nu}=8 \pi T_{\mu \nu}Gμν=8πTμν, which is what the variation must produce. We conclude that the action
(40.56) S = d 4 x g [ R ( x ) + 16 π L m ] (40.56) S = d 4 x g R ( x ) + 16 π L m {:(40.56)S=intd^(4)xsqrt(-g)[R(x)+16 piL_(m)]:}\begin{equation*} S=\int \mathrm{d}^{4} x \sqrt{-g}\left[R(x)+16 \pi \mathcal{L}_{\mathrm{m}}\right] \tag{40.56} \end{equation*}(40.56)S=d4xg[R(x)+16πLm]
when varied with respect to the components of the metric, leads to the Einstein equation.
Expanding eqn (40.54), we have a final prescription for calculating the energy-momentum tensor from the matter action:
(40.57) T μ ν = 2 δ L m δ g μ ν + g μ ν L m (40.57) T μ ν = 2 δ L m δ g μ ν + g μ ν L m {:(40.57)T^(mu nu)=2(deltaL_(m))/(deltag_(mu nu))+g^(mu nu)L_(m):}\begin{equation*} T^{\mu \nu}=2 \frac{\delta \mathcal{L}_{\mathrm{m}}}{\delta g_{\mu \nu}}+g^{\mu \nu} \mathcal{L}_{\mathrm{m}} \tag{40.57} \end{equation*}(40.57)Tμν=2δLmδgμν+gμνLm
We use this below. However, because the derivative operator μ μ del_(mu)\partial_{\mu}μ is naturally an object with a down index, the action for matter fields is most simply written in terms of a derivative with respect to g μ ν g μ ν g^(mu nu)g^{\mu \nu}gμν. It's then easiest to use an analogous expression for the all-down components of T T T\boldsymbol{T}T, which is 19 19 ^(19){ }^{19}19
(40.59) T μ ν = 2 L m g μ ν + g μ ν L m (40.59) T μ ν = 2 L m g μ ν + g μ ν L m {:(40.59)T_(mu nu)=-2(delL_(m))/(delg^(mu nu))+g_(mu nu)L_(m):}\begin{equation*} T_{\mu \nu}=-2 \frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial g^{\mu \nu}}+g_{\mu \nu} \mathcal{L}_{\mathrm{m}} \tag{40.59} \end{equation*}(40.59)Tμν=2Lmgμν+gμνLm
Let's see some examples.
19 19 ^(19){ }^{19}19 To prove this, use the chain rule to say
T μ ν = 2 L m g α β g α β g μ ν + g μ ν L m = 2 g α μ g ν β L m g α β + g μ ν L m T μ ν = 2 L m g α β g α β g μ ν + g μ ν L m = 2 g α μ g ν β L m g α β + g μ ν L m {:[T^(mu nu)=2(delL_(m))/(delg^(alpha beta))(delg^(alpha beta))/(delg_(mu nu))+g^(mu nu)L_(m)],[=-2g^(alpha mu)g^(nu beta)(delL_(m))/(delg^(alpha beta))+g^(mu nu)L_(m)]:}\begin{aligned} T^{\mu \nu} & =2 \frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial g^{\alpha \beta}} \frac{\partial g^{\alpha \beta}}{\partial g_{\mu \nu}}+g^{\mu \nu} \mathcal{L}_{\mathrm{m}} \\ & =-2 g^{\alpha \mu} g^{\nu \beta} \frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial g^{\alpha \beta}}+g^{\mu \nu} \mathcal{L}_{\mathrm{m}} \end{aligned}Tμν=2Lmgαβgαβgμν+gμνLm=2gαμgνβLmgαβ+gμνLm
Then lower indices in the usual way
T γ σ = T μ ν g μ γ g ν σ = 2 g α μ g ν β g μ γ g ν σ L m g α β + g μ ν g μ γ g ν σ L m = 2 δ α γ δ β σ L m g α β (40.58) + g γ σ L m , T γ σ = T μ ν g μ γ g ν σ = 2 g α μ g ν β g μ γ g ν σ L m g α β + g μ ν g μ γ g ν σ L m = 2 δ α γ δ β σ L m g α β (40.58) + g γ σ L m , {:[T_(gamma sigma)=T^(mu nu)g_(mu gamma)g_(nu sigma)],[=-2g^(alpha mu)g^(nu beta)g_(mu gamma)g_(nu sigma)(delL_(m))/(delg^(alpha beta))],[+g^(mu nu)g_(mu gamma)g_(nu sigma)L_(m)],[=-2delta^(alpha)_(gamma)delta^(beta)_(sigma)(delL_(m))/(delg^(alpha beta))],[(40.58)+g_(gamma sigma)L_(m)","]:}\begin{align*} T_{\gamma \sigma}= & T^{\mu \nu} g_{\mu \gamma} g_{\nu \sigma} \\ = & -2 g^{\alpha \mu} g^{\nu \beta} g_{\mu \gamma} g_{\nu \sigma} \frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial g^{\alpha \beta}} \\ & +g^{\mu \nu} g_{\mu \gamma} g_{\nu \sigma} \mathcal{L}_{\mathrm{m}} \\ = & -2 \delta^{\alpha}{ }_{\gamma} \delta^{\beta}{ }_{\sigma} \frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial g^{\alpha \beta}} \\ & +g_{\gamma \sigma} \mathcal{L}_{\mathrm{m}}, \tag{40.58} \end{align*}Tγσ=Tμνgμγgνσ=2gαμgνβgμγgνσLmgαβ+gμνgμγgνσLm=2δαγδβσLmgαβ(40.58)+gγσLm,
where we contracted on μ μ mu\muμ and ν ν nu\nuν in the final step.
\curvearrowright Electromagnetism is discussed in detail in Chapter 42.
Example 40.12
The scalar field Lagrangian is given by
(40.60) L m = 1 2 g μ ν ϕ ; μ ϕ ; ν m 2 2 ϕ 2 (40.60) L m = 1 2 g μ ν ϕ ; μ ϕ ; ν m 2 2 ϕ 2 {:(40.60)L_(m)=-(1)/(2)g^(mu nu)phi_(;mu)phi_(;nu)-(m^(2))/(2)phi^(2):}\begin{equation*} \mathcal{L}_{\mathrm{m}}=-\frac{1}{2} g^{\mu \nu} \phi_{; \mu} \phi_{; \nu}-\frac{m^{2}}{2} \phi^{2} \tag{40.60} \end{equation*}(40.60)Lm=12gμνϕ;μϕ;νm22ϕ2
We have
(40.61) L g μ ν = 1 2 ϕ ; μ ϕ ; ν (40.61) L g μ ν = 1 2 ϕ ; μ ϕ ; ν {:(40.61)(delL)/(delg^(mu nu))=-(1)/(2)phi_(;mu)phi_(;nu):}\begin{equation*} \frac{\partial \mathcal{L}}{\partial g^{\mu \nu}}=-\frac{1}{2} \phi_{; \mu} \phi_{; \nu} \tag{40.61} \end{equation*}(40.61)Lgμν=12ϕ;μϕ;ν
leading to the energy-momentum tensor
(40.62) T μ ν = ϕ ; μ ϕ ; ν g μ ν ( 1 2 g α β ϕ ; α ϕ ; β + m 2 2 ϕ 2 ) (40.62) T μ ν = ϕ ; μ ϕ ; ν g μ ν 1 2 g α β ϕ ; α ϕ ; β + m 2 2 ϕ 2 {:(40.62)T_(mu nu)=phi_(;mu)phi_(;nu)-g_(mu nu)((1)/(2)g^(alpha beta)phi_(;alpha)phi_(;beta)+(m^(2))/(2)phi^(2)):}\begin{equation*} T_{\mu \nu}=\phi_{; \mu} \phi_{; \nu}-g_{\mu \nu}\left(\frac{1}{2} g^{\alpha \beta} \phi_{; \alpha} \phi_{; \beta}+\frac{m^{2}}{2} \phi^{2}\right) \tag{40.62} \end{equation*}(40.62)Tμν=ϕ;μϕ;νgμν(12gαβϕ;αϕ;β+m22ϕ2)
In flat space, this becomes
(40.63) T μ ν = μ ϕ ν ϕ 1 2 η μ ν ( μ ϕ ) 2 η μ ν m 2 2 ϕ 2 (40.63) T μ ν = μ ϕ ν ϕ 1 2 η μ ν μ ϕ 2 η μ ν m 2 2 ϕ 2 {:(40.63)T_(mu nu)=del_(mu)phidel_(nu)phi-(1)/(2)eta_(mu nu)(del_(mu)phi)^(2)-eta_(mu nu)(m^(2))/(2)phi^(2):}\begin{equation*} T_{\mu \nu}=\partial_{\mu} \phi \partial_{\nu} \phi-\frac{1}{2} \eta_{\mu \nu}\left(\partial_{\mu} \phi\right)^{2}-\eta_{\mu \nu} \frac{m^{2}}{2} \phi^{2} \tag{40.63} \end{equation*}(40.63)Tμν=μϕνϕ12ημν(μϕ)2ημνm22ϕ2
which has timelike component T 00 = 1 2 ( 0 ϕ ) 2 + 1 2 ( ϕ ) 2 + m 2 2 ϕ 2 T 00 = 1 2 0 ϕ 2 + 1 2 ( ϕ ) 2 + m 2 2 ϕ 2 T_(00)=(1)/(2)(del_(0)phi)^(2)+(1)/(2)( vec(grad)phi)^(2)+(m^(2))/(2)*phi^(2)T_{00}=\frac{1}{2}\left(\partial_{0} \phi\right)^{2}+\frac{1}{2}(\vec{\nabla} \phi)^{2}+\frac{m^{2}}{2} \cdot \phi^{2}T00=12(0ϕ)2+12(ϕ)2+m22ϕ2, which simply reflects the sum of kinetic energy and potential energy.
Electromagnetism is described by the Lagrangian
(40.64) L m = 1 4 F α β F γ δ g α γ g β δ . (40.65) L g α γ = 1 2 F α β F γ δ g β δ . (40.64) L m = 1 4 F α β F γ δ g α γ g β δ . (40.65) L g α γ = 1 2 F α β F γ δ g β δ . {:[(40.64)L_(m)=-(1)/(4)F_(alpha beta)F_(gamma delta)g^(alpha gamma)g^(beta delta).],[(40.65)(delL)/(delg^(alpha gamma))=-(1)/(2)F_(alpha beta)F_(gamma delta)g^(beta delta).]:}\begin{gather*} \mathcal{L}_{\mathrm{m}}=-\frac{1}{4} F_{\alpha \beta} F_{\gamma \delta} g^{\alpha \gamma} g^{\beta \delta} . \tag{40.64}\\ \frac{\partial \mathcal{L}}{\partial g^{\alpha \gamma}}=-\frac{1}{2} F_{\alpha \beta} F_{\gamma \delta} g^{\beta \delta} . \tag{40.65} \end{gather*}(40.64)Lm=14FαβFγδgαγgβδ.(40.65)Lgαγ=12FαβFγδgβδ.
We have
The energy-momentum tensor is, therefore,
(40.66) T α γ = ( F α β F γ δ g β δ 1 4 g α γ F σ ρ F μ ν g σ μ g ρ ν ) (40.66) T α γ = F α β F γ δ g β δ 1 4 g α γ F σ ρ F μ ν g σ μ g ρ ν {:(40.66)T_(alpha gamma)=(F_(alpha beta)F_(gamma delta)g^(beta delta)-(1)/(4)g_(alpha gamma)F_(sigma rho)F_(mu nu)g^(sigma mu)g^(rho nu)):}\begin{equation*} T_{\alpha \gamma}=\left(F_{\alpha \beta} F_{\gamma \delta} g^{\beta \delta}-\frac{1}{4} g_{\alpha \gamma} F_{\sigma \rho} F_{\mu \nu} g^{\sigma \mu} g^{\rho \nu}\right) \tag{40.66} \end{equation*}(40.66)Tαγ=(FαβFγδgβδ14gαγFσρFμνgσμgρν)
The previous example shows how we can extract the energy-momentum tensor for the fields in the Universe. An exception is gravitation itself. In the same way that general relativity removes the notion of a gravitational force, it also prevents us from identifying an energy-moment tensor for a gravitational field.

40.6 Noether's theorem

One of the most profound statements in field theory is Noether's the-
20 20 ^(20){ }^{20}20 Amalie Emmy Noether (1882-1935). orem. 20 20 ^(20){ }^{20}20 It says that if a field theory has a symmetry then there is a corresponding conservation equation corresponding to a conserved quantity. Noether's theorem elegantly reveals these conservation equations and will consider how it can be used on the metric field.
Consider the differential of the Lagrangian in Minkowski space
(40.67) δ L = L ϕ δ ϕ + L ( μ ϕ ) δ ( μ ϕ ) + L η μ ν δ η μ ν (40.67) δ L = L ϕ δ ϕ + L μ ϕ δ μ ϕ + L η μ ν δ η μ ν {:(40.67)deltaL=(delL)/(del phi)delta phi+(delL)/(del(del_(mu)phi))delta(del_(mu)phi)+(delL)/(deleta_(mu nu))deltaeta_(mu nu):}\begin{equation*} \delta \mathcal{L}=\frac{\partial \mathcal{L}}{\partial \phi} \delta \phi+\frac{\partial \mathcal{L}}{\partial\left(\partial_{\mu} \phi\right)} \delta\left(\partial_{\mu} \phi\right)+\frac{\partial \mathcal{L}}{\partial \eta_{\mu \nu}} \delta \eta_{\mu \nu} \tag{40.67} \end{equation*}(40.67)δL=Lϕδϕ+L(μϕ)δ(μϕ)+Lημνδημν
If we make a continuous transformation of the variables in the La-
21 21 ^(21){ }^{21}21 Strictly, if it doesn't change up to a surface term. grangian and find that the action doesn't change 21 21 ^(21){ }^{21}21 we say that the transformation represents a symmetry. In order to examine the sym- metries corresponding to the geometric features of the theory, we are
going to pull this Lagrangian along a vector ξ ξ xi\boldsymbol{\xi}ξ and compare the result with our original Lagrangian.
The mathematical tool for comparing a field after it has been transported by a vector is, of course, the Lie derivative. We therefore upgrade the differential δ L δ L deltaL\delta \mathcal{L}δL to a Lie derivative
(40.68) δ L = £ ξ L = ξ α α L (40.68) δ L = £ ξ L = ξ α α L {:(40.68)deltaL=£_(xi)L=xi^(alpha)del_(alpha)L:}\begin{equation*} \delta \mathcal{L}=£_{\xi} \mathcal{L}=\xi^{\alpha} \partial_{\alpha} \mathcal{L} \tag{40.68} \end{equation*}(40.68)δL=£ξL=ξααL
where the final equality holds because L L L\mathcal{L}L is a scalar function. Exchanging δ δ delta\deltaδ for the Lie derivative, eqn 40.67 becomes
(40.69) ξ α α L = L ϕ £ ξ ϕ + L ( μ ϕ ) £ ξ ( μ ϕ ) + L η μ ν £ ξ η μ ν . (40.69) ξ α α L = L ϕ £ ξ ϕ + L μ ϕ £ ξ μ ϕ + L η μ ν £ ξ η μ ν . {:(40.69)xi^(alpha)del_(alpha)L=(delL)/(del phi)£_(xi)phi+(delL)/(del(del_(mu)phi))£_(xi)(del_(mu)phi)+(delL)/(deleta_(mu nu))£_(xi)eta_(mu nu).:}\begin{equation*} \xi^{\alpha} \partial_{\alpha} \mathcal{L}=\frac{\partial \mathcal{L}}{\partial \phi} £_{\boldsymbol{\xi}} \phi+\frac{\partial \mathcal{L}}{\partial\left(\partial_{\mu} \phi\right)} £_{\boldsymbol{\xi}}\left(\partial_{\mu} \phi\right)+\frac{\partial \mathcal{L}}{\partial \eta_{\mu \nu}} £_{\boldsymbol{\xi}} \eta_{\mu \nu} . \tag{40.69} \end{equation*}(40.69)ξααL=Lϕ£ξϕ+L(μϕ)£ξ(μϕ)+Lημν£ξημν.
We have seen that in examining geometry (encoded in the metric field), symmetries are described in terms of isometries in the metric, which give rise to Killing fields. So we set ξ ξ xi\boldsymbol{\xi}ξ to be a Killing field, defined by £ ξ η α β = 0 £ ξ η α β = 0 £_(xi)eta_(alpha beta)=0£_{\xi} \eta_{\alpha \beta}=0£ξηαβ=0, which wipes out the final term in eqn 40.69 . We can then use the Euler-Lagrange equation 22 22 ^(22){ }^{22}22 to substitute for L / ϕ L / ϕ delL//del phi\partial \mathcal{L} / \partial \phiL/ϕ. Rearranging then gives
(40.71) α [ L ( α ϕ ) £ ξ ϕ ξ α L ] = 0 (40.71) α L α ϕ £ ξ ϕ ξ α L = 0 {:(40.71)del_(alpha)[(delL)/(del(del_(alpha)phi))£_(xi)phi-xi^(alpha)L]=0:}\begin{equation*} \partial_{\alpha}\left[\frac{\partial \mathcal{L}}{\partial\left(\partial_{\alpha} \phi\right)} £_{\xi} \phi-\xi^{\alpha} \mathcal{L}\right]=0 \tag{40.71} \end{equation*}(40.71)α[L(αϕ)£ξϕξαL]=0
This is a conservation equation 23 23 ^(23){ }^{23}23 that tells us the quantity in the bracket is conserved. This is Noether's theorem: a symmetry (identified via the Killing vector ξ ξ xi\boldsymbol{\xi}ξ ) has given rise to a conserved Noether current J N J N J_(N)\boldsymbol{J}_{\mathrm{N}}JN with components
(40.72) J N α = L ( α ϕ ) £ ξ ϕ ξ α L (40.72) J N α = L α ϕ £ ξ ϕ ξ α L {:(40.72)J_(N)^(alpha)=(delL)/(del(del_(alpha)phi))£_(xi)phi-xi^(alpha)L:}\begin{equation*} J_{\mathrm{N}}^{\alpha}=\frac{\partial \mathcal{L}}{\partial\left(\partial_{\alpha} \phi\right)} £_{\boldsymbol{\xi}} \phi-\xi^{\alpha} \mathcal{L} \tag{40.72} \end{equation*}(40.72)JNα=L(αϕ)£ξϕξαL
In particular, we note that for Minkowski space that we're considering, translation vectors in each of the Cartesian directions are Killing vectors.

Example 40.13

Translations in the x x xxx-direction leave the metric unchanged and so we have a Killing vector
(40.73) ξ = e x = x . (40.73) ξ = e x = x . {:(40.73)xi=e_(x)=(del)/(del x).:}\begin{equation*} \boldsymbol{\xi}=\boldsymbol{e}_{x}=\frac{\partial}{\partial x} . \tag{40.73} \end{equation*}(40.73)ξ=ex=x.
This gives a Noether current with components
(40.74) J N α = L ( α ϕ ) ϕ x δ α x L (40.74) J N α = L α ϕ ϕ x δ α x L {:(40.74)J_(N)^(alpha)=(delL)/(del(del_(alpha)phi))*(del phi)/(del x)-delta^(alpha)_(x)L:}\begin{equation*} J_{\mathrm{N}}^{\alpha}=\frac{\partial \mathcal{L}}{\partial\left(\partial_{\alpha} \phi\right)} \cdot \frac{\partial \phi}{\partial x}-\delta^{\alpha}{ }_{x} \mathcal{L} \tag{40.74} \end{equation*}(40.74)JNα=L(αϕ)ϕxδαxL
We can group the four Noether currents that result from translations into a single tensor S S S\boldsymbol{S}S. We call this quantity the canonical energymomentum tensor. It has components
(40.75) S α β = L ( α ϕ ) β ϕ g α β L (40.75) S α β = L α ϕ β ϕ g α β L {:(40.75)S^(alpha beta)=(delL)/(del(del_(alpha)phi))del^(beta)phi-g^(alpha beta)L:}\begin{equation*} S^{\alpha \beta}=\frac{\partial \mathcal{L}}{\partial\left(\partial_{\alpha} \phi\right)} \partial^{\beta} \phi-g^{\alpha \beta} \mathcal{L} \tag{40.75} \end{equation*}(40.75)Sαβ=L(αϕ)βϕgαβL
so that α S α β = 0 α S α β = 0 del_(alpha)S^(alpha beta)=0\partial_{\alpha} S^{\alpha \beta}=0αSαβ=0.
The tensors S S S\boldsymbol{S}S and the energy-momentum tensor T T T\boldsymbol{T}T are the same for scalar fields, but differ for more complicated matter fields. Generally, S S SSS isn't symmetrical and doesn't generalize for curved spacetimes, so is generally unsuitable for incorporation into Einstein's equation. 24 24 ^(24){ }^{24}24
22 22 ^(22){ }^{22}22 The Euler-Lagrange equation is
(40.70) L ϕ = α L ( α ϕ ) (40.70) L ϕ = α L α ϕ {:(40.70)(delL)/(del phi)=del_(alpha)(delL)/(del(del_(alpha)phi)):}\begin{equation*} \frac{\partial \mathcal{L}}{\partial \phi}=\partial_{\alpha} \frac{\partial \mathcal{L}}{\partial\left(\partial_{\alpha} \phi\right)} \tag{40.70} \end{equation*}(40.70)Lϕ=αL(αϕ)
23 23 ^(23){ }^{23}23 This is an equation of continuity, telling us that the divergence of something is zero. In some cases, we may thing is zero. In some cases, we may
have to worry about the global boundhave to worry about the global bound-
ary conditions to show that there is no effect of a surface term. We will not worry about that here.
24 24 ^(24){ }^{24}24 The need for a symmetrical energymomentum tensor is discussed in the exercises.
\curvearrowright Since the algebra is rather involved, this argument is left out of most books. The techout of most books. The tech-
nical detail is all contained in Example 40.14, which can be Example 40.14, which can be skipped if so desired.

40.7 The perfect fluid

Fluids were examined in detail in Chapter 39. Here we show how the perfect fluid's equation of motion and energy-momentum tensor can be derived directly from a Lagrangian. The perfect fluid has no viscosity and moves adiabatically. In field theory, a fluid is described by two features: (i) its mass-density field ρ 0 ( x ) ρ 0 ( x ) rho_(0)(x)\rho_{0}(x)ρ0(x) and (ii) a congruence of curves called flow lines (or stream lines) with timelike tangent vectors w = τ w = τ w=(del)/(del tau)\boldsymbol{w}=\frac{\partial}{\partial \tau}w=τ. When normalized, the tangent w w w\boldsymbol{w}w becomes the velocity v v v\boldsymbol{v}v field of the fluid. That is to say
(40.76) v = w [ g ( w , w ) ] 1 2 (40.76) v = w [ g ( w , w ) ] 1 2 {:(40.76)v=(w)/([-g(w,w)]^((1)/(2))):}\begin{equation*} \boldsymbol{v}=\frac{\boldsymbol{w}}{[-\boldsymbol{g}(\boldsymbol{w}, \boldsymbol{w})]^{\frac{1}{2}}} \tag{40.76} \end{equation*}(40.76)v=w[g(w,w)]12
so that v 2 = g ( v , v ) = 1 v 2 = g ( v , v ) = 1 v^(2)=g(v,v)=-1\boldsymbol{v}^{2}=\boldsymbol{g}(\boldsymbol{v}, \boldsymbol{v})=-1v2=g(v,v)=1, as we require for a velocity field. The mass current is J = ρ 0 v J = ρ 0 v J=rho_(0)v\boldsymbol{J}=\rho_{0} \boldsymbol{v}J=ρ0v and we impose the condition that J = 0 J = 0 grad*J=0\boldsymbol{\nabla} \cdot \boldsymbol{J}=0J=0 or equivalently J α ; α = 0 J α ; α = 0 J^(alpha)_(;alpha)=0J^{\alpha}{ }_{; \alpha}=0Jα;α=0, which is to say that the current is conserved. The fluid has a Lagrangian
(40.77) L m = ρ = ρ 0 ( 1 + u ) , (40.77) L m = ρ = ρ 0 ( 1 + u ) , {:(40.77)L_(m)=-rho=-rho_(0)(1+u)",":}\begin{equation*} \mathcal{L}_{\mathrm{m}}=-\rho=-\rho_{0}(1+u), \tag{40.77} \end{equation*}(40.77)Lm=ρ=ρ0(1+u),
where u u uuu is the specific internal energy, which is a function of ρ 0 ρ 0 rho_(0)\rho_{0}ρ0 [that is u = u ( ρ 0 ) ] . 25 u = u ρ 0 . 25 {:u=u(rho_(0))].^(25)\left.u=u\left(\rho_{0}\right)\right] .{ }^{25}u=u(ρ0)].25 The quantity ρ = ρ 0 ( 1 + u ) ρ = ρ 0 ( 1 + u ) rho=rho_(0)(1+u)\rho=\rho_{0}(1+u)ρ=ρ0(1+u) the total energy density of the fluid.
To derive the equations of motion we need to vary the action, which involves varying the flow lines. This changes v v v\boldsymbol{v}v, so while we vary the flow lines, we imagine adjusting ρ 0 ρ 0 rho_(0)\rho_{0}ρ0 to ensure that the current J J J\boldsymbol{J}J is conserved throughout. This enables us to continue using the conservation law. The variation of the flow lines can be followed by defining K K K\boldsymbol{K}K as the displacement vector of a point on a flow-line as we vary the flow. How much the flow has changed can then be measured by carrying the flow pattern along the vector field K K K\boldsymbol{K}K and comparing the tangent fields before and after. That is to say, the effect of the variation is contained in the variation in w w w\boldsymbol{w}w, which is measured by a Lie derivative
(40.79) δ w = £ K w (40.79) δ w = £ K w {:(40.79)delta w=£_(K)w:}\begin{equation*} \delta \boldsymbol{w}=£_{\boldsymbol{K}} \boldsymbol{w} \tag{40.79} \end{equation*}(40.79)δw=£Kw
What follows is a long computation that we split into several steps. 26 26 ^(26){ }^{26}26
Example 40.14
Step 0: Write the variation in S S SSS in terms of a differential δ ρ 0 δ ρ 0 deltarho_(0)\delta \rho_{0}δρ0 in the usual way, by saying
δ S = d 4 x g d L d ρ 0 δ ρ 0 (40.80) = d 4 x g ( 1 + d ( ρ 0 u ) d ρ 0 ) δ ρ 0 = 0 . δ S = d 4 x g d L d ρ 0 δ ρ 0 (40.80) = d 4 x g 1 + d ρ 0 u d ρ 0 δ ρ 0 = 0 . {:[delta S=-intd^(4)xsqrt(-g)((d)L)/((d)rho_(0))deltarho_(0)],[(40.80)=-intd^(4)xsqrt(-g)(1+(d(rho_(0)u))/(drho_(0)))deltarho_(0)=0.]:}\begin{align*} \delta S & =-\int \mathrm{d}^{4} x \sqrt{-g} \frac{\mathrm{~d} \mathcal{L}}{\mathrm{~d} \rho_{0}} \delta \rho_{0} \\ & =-\int \mathrm{d}^{4} x \sqrt{-g}\left(1+\frac{\mathrm{d}\left(\rho_{0} u\right)}{\mathrm{d} \rho_{0}}\right) \delta \rho_{0}=0 . \tag{40.80} \end{align*}δS=d4xg dL dρ0δρ0(40.80)=d4xg(1+d(ρ0u)dρ0)δρ0=0.
Step I: In order to use the expression for δ S δ S delta S\delta SδS, we need an expression for δ ρ 0 δ ρ 0 deltarho_(0)\delta \rho_{0}δρ0. One can be found from our conservation of current equation. Since δ ( J α ; α ) = 0 = ( δ J α ) ; α δ J α ; α = 0 = δ J α ; α delta(J^(alpha)_(;alpha))=0=(deltaJ^(alpha))_(;alpha)\delta\left(J^{\alpha}{ }_{; \alpha}\right)=0=\left(\delta J^{\alpha}\right)_{; \alpha}δ(Jα;α)=0=(δJα);α we have
( δ J α ) ; α = [ ( δ ρ 0 ) v α + ρ 0 δ v α ] ; α (40.81) = ( δ ρ 0 ) ; α v α + δ ρ 0 v ; α α + ( ρ 0 ) ; α δ v α + ρ 0 ( δ v α ) ; α = 0 . δ J α ; α = δ ρ 0 v α + ρ 0 δ v α ; α (40.81) = δ ρ 0 ; α v α + δ ρ 0 v ; α α + ρ 0 ; α δ v α + ρ 0 δ v α ; α = 0 . {:[(deltaJ^(alpha))_(;alpha)=[(deltarho_(0))v^(alpha)+rho_(0)deltav^(alpha)]_(;alpha)],[(40.81)=(deltarho_(0))_(;alpha)v^(alpha)+deltarho_(0)v_(;alpha)^(alpha)+(rho_(0))_(;alpha)deltav^(alpha)+rho_(0)(deltav^(alpha))_(;alpha)=0.]:}\begin{align*} \left(\delta J^{\alpha}\right)_{; \alpha} & =\left[\left(\delta \rho_{0}\right) v^{\alpha}+\rho_{0} \delta v^{\alpha}\right]_{; \alpha} \\ & =\left(\delta \rho_{0}\right)_{; \alpha} v^{\alpha}+\delta \rho_{0} v_{; \alpha}^{\alpha}+\left(\rho_{0}\right)_{; \alpha} \delta v^{\alpha}+\rho_{0}\left(\delta v^{\alpha}\right)_{; \alpha}=0 . \tag{40.81} \end{align*}(δJα);α=[(δρ0)vα+ρ0δvα];α(40.81)=(δρ0);αvα+δρ0v;αα+(ρ0);αδvα+ρ0(δvα);α=0.
Step II: In order to use the previous equation, we next need an expression for δ v δ v delta v\delta \boldsymbol{v}δv. This can be extracted from δ w δ w delta w\delta \boldsymbol{w}δw, which was given by the Lie derivative. So since v = [ g ( w , w ) ] 1 2 w v = [ g ( w , w ) ] 1 2 w v=[-g(w,w)]^(-(1)/(2))w\boldsymbol{v}=[-\boldsymbol{g}(\boldsymbol{w}, \boldsymbol{w})]^{-\frac{1}{2}} \boldsymbol{w}v=[g(w,w)]12w we have, using the chain rule,
(40.82) δ v α = 1 2 [ g ( w , w ) ] 3 2 [ δ g ( w , w ) ] w α + [ g ( w , w ) ] 1 2 δ w α . (40.82) δ v α = 1 2 [ g ( w , w ) ] 3 2 [ δ g ( w , w ) ] w α + [ g ( w , w ) ] 1 2 δ w α . {:(40.82)deltav^(alpha)=(1)/(2)[-g(w","w)]^(-(3)/(2))[delta g(w","w)]w^(alpha)+[-g(w","w)]^(-(1)/(2))deltaw^(alpha).:}\begin{equation*} \delta v^{\alpha}=\frac{1}{2}[-\boldsymbol{g}(\boldsymbol{w}, \boldsymbol{w})]^{-\frac{3}{2}}[\delta \boldsymbol{g}(\boldsymbol{w}, \boldsymbol{w})] w^{\alpha}+[-\boldsymbol{g}(\boldsymbol{w}, \boldsymbol{w})]^{-\frac{1}{2}} \delta w^{\alpha} . \tag{40.82} \end{equation*}(40.82)δvα=12[g(w,w)]32[δg(w,w)]wα+[g(w,w)]12δwα.
This is simplified by considering the term
δ g ( w , w ) = 2 g ( w , w ) w γ δ w γ (40.83) = 2 g β γ w β δ w γ δ g ( w , w ) = 2 g ( w , w ) w γ δ w γ (40.83) = 2 g β γ w β δ w γ {:[delta g(w","w)=2(del g(w,w))/(delw^(gamma))deltaw^(gamma)],[(40.83)=2g_(beta gamma)w^(beta)deltaw^(gamma)]:}\begin{align*} \delta \boldsymbol{g}(\boldsymbol{w}, \boldsymbol{w}) & =2 \frac{\partial \boldsymbol{g}(\boldsymbol{w}, \boldsymbol{w})}{\partial w^{\gamma}} \delta \boldsymbol{w}^{\gamma} \\ & =2 g_{\beta \gamma} \boldsymbol{w}^{\beta} \delta w^{\gamma} \tag{40.83} \end{align*}δg(w,w)=2g(w,w)wγδwγ(40.83)=2gβγwβδwγ
We note further that, by definition of the Lie derivative, we have that
(40.84) δ w α = ( £ K w ) α = w ; β α K β K ; β α w β (40.84) δ w α = £ K w α = w ; β α K β K ; β α w β {:(40.84)deltaw^(alpha)=(£_(K)w)^(alpha)=w_(;beta)^(alpha)K^(beta)-K_(;beta)^(alpha)w^(beta):}\begin{equation*} \delta w^{\alpha}=\left(£_{\boldsymbol{K}} \boldsymbol{w}\right)^{\alpha}=w_{; \beta}^{\alpha} K^{\beta}-K_{; \beta}^{\alpha} w^{\beta} \tag{40.84} \end{equation*}(40.84)δwα=(£Kw)α=w;βαKβK;βαwβ
Putting this together gives the result of Step II: an expression for δ v α δ v α deltav^(alpha)\delta v^{\alpha}δvα, which is
δ v α = 1 2 [ g ( w , w ) ] 3 2 2 g β γ w β ( w γ ; σ K σ K ; σ γ w σ ) w α + [ g ( w , w ) ] 1 2 ( w α ; β K β K ; β α w α ) (40.85) = v α ; β K β K ; β α v β v α v β K β ; γ v γ , δ v α = 1 2 [ g ( w , w ) ] 3 2 2 g β γ w β w γ ; σ K σ K ; σ γ w σ w α + [ g ( w , w ) ] 1 2 w α ; β K β K ; β α w α (40.85) = v α ; β K β K ; β α v β v α v β K β ; γ v γ , {:[deltav^(alpha)=(1)/(2)[-g(w","w)]^(-(3)/(2))2g_(beta gamma)w^(beta)(w^(gamma)_(;sigma)K^(sigma)-K_(;sigma)^(gamma)w^(sigma))w^(alpha)],[+[-g(w","w)]^(-(1)/(2))(w^(alpha)_(;beta)K^(beta)-K_(;beta)^(alpha)w^(alpha))],[(40.85)=v^(alpha)_(;beta)K^(beta)-K_(;beta)^(alpha)v^(beta)-v^(alpha)v^(beta)K_(beta;gamma)v^(gamma)","]:}\begin{align*} \delta v^{\alpha}= & \frac{1}{2}[-\boldsymbol{g}(\boldsymbol{w}, \boldsymbol{w})]^{-\frac{3}{2}} 2 g_{\beta \gamma} w^{\beta}\left(w^{\gamma}{ }_{; \sigma} K^{\sigma}-K_{; \sigma}^{\gamma} w^{\sigma}\right) w^{\alpha} \\ & +[-\boldsymbol{g}(\boldsymbol{w}, \boldsymbol{w})]^{-\frac{1}{2}}\left(w^{\alpha}{ }_{; \beta} K^{\beta}-K_{; \beta}^{\alpha} w^{\alpha}\right) \\ = & v^{\alpha}{ }_{; \beta} K^{\beta}-K_{; \beta}^{\alpha} v^{\beta}-v^{\alpha} v^{\beta} K_{\beta ; \gamma} v^{\gamma}, \tag{40.85} \end{align*}δvα=12[g(w,w)]322gβγwβ(wγ;σKσK;σγwσ)wα+[g(w,w)]12(wα;βKβK;βαwα)(40.85)=vα;βKβK;βαvβvαvβKβ;γvγ,
where, in the final line, we've written everything in terms of components of v v v\boldsymbol{v}v and the displacement field K K K\boldsymbol{K}K.
Step III: Substitute for δ v α δ v α deltav^(alpha)\delta v^{\alpha}δvα in eqn 40.81 (which, remember, only describes the conservation law J α ; α = 0 J α ; α = 0 J^(alpha)_(;alpha)=0J^{\alpha}{ }_{; \alpha}=0Jα;α=0 ). We find that the number of terms seems to increase rather alarmingly! Specifically, we find
0 = ( δ ρ 0 ) ; α v α + δ ρ 0 v α ; α + ( ρ 0 ) ; α δ v α + ρ 0 ( δ v α ) ; α = ( δ ρ 0 ) ; α v α + δ ρ 0 v α ; α + ( ρ 0 ) ; α ( v α ; β K β K α ; β v β v α v β K β ; γ v γ ) + ρ 0 ( v α ; β K β K α ; β v β v α v β K β ; γ v γ ) ; α = ( δ ρ 0 ) ; α v α + δ ρ 0 v α ; α + ( ρ 0 ) ; α ( v α ; β K β K ; β α v β v α v β K β ; γ v γ ) + ρ 0 ( v α ; β α K β + v ; β α K ; α β K α ; β α v β K ; β α v β ; α v α ; α v β K β ; γ v γ v α v ; α β K β ; γ v γ v α v β K β ; γ α v γ v α v β K β ; γ v γ ; α ) . ( 40.86 ) 0 = δ ρ 0 ; α v α + δ ρ 0 v α ; α + ρ 0 ; α δ v α + ρ 0 δ v α ; α = δ ρ 0 ; α v α + δ ρ 0 v α ; α + ρ 0 ; α v α ; β K β K α ; β v β v α v β K β ; γ v γ + ρ 0 v α ; β K β K α ; β v β v α v β K β ; γ v γ ; α = δ ρ 0 ; α v α + δ ρ 0 v α ; α + ρ 0 ; α v α ; β K β K ; β α v β v α v β K β ; γ v γ + ρ 0 v α ; β α K β + v ; β α K ; α β K α ; β α v β K ; β α v β ; α v α ; α v β K β ; γ v γ v α v ; α β K β ; γ v γ v α v β K β ; γ α v γ v α v β K β ; γ v γ ; α . ( 40.86 ) {:[0=(deltarho_(0))_(;alpha)v^(alpha)+deltarho_(0)v^(alpha)_(;alpha)+(rho_(0))_(;alpha)deltav^(alpha)+rho_(0)(deltav^(alpha))_(;alpha)],[=(deltarho_(0))_(;alpha)v^(alpha)+deltarho_(0)v^(alpha)_(;alpha)+(rho_(0))_(;alpha)(v^(alpha)_(;beta)K^(beta)-K^(alpha)_(;beta)v^(beta)-v^(alpha)v^(beta)K_(beta;gamma)v^(gamma))],[+rho_(0)(v^(alpha)_(;beta)K^(beta)-K^(alpha)_(;beta)v^(beta)-v^(alpha)v^(beta)K_(beta;gamma)v^(gamma))_(;alpha)],[=(deltarho_(0))_(;alpha)v^(alpha)+deltarho_(0)v^(alpha)_(;alpha)+(rho_(0))_(;alpha)(v^(alpha)_(;beta)K^(beta)-K_(;beta)^(alpha)v^(beta)-v^(alpha)v^(beta)K_(beta;gamma)v^(gamma))],[+rho_(0)(v^(alpha)_(;beta alpha)K^(beta)+v_(;beta)^(alpha)K_(;alpha)^(beta)-K^(alpha)_(;beta alpha)v^(beta)-K_(;beta)^(alpha)v^(beta)_(;alpha):}],[{:-v^(alpha)_(;alpha)v^(beta)K_(beta;gamma)v^(gamma)-v^(alpha)v_(;alpha)^(beta)K_(beta;gamma)v^(gamma)-v^(alpha)v^(beta)K_(beta;gamma alpha)v^(gamma)-v^(alpha)v^(beta)K_(beta;gamma)v^(gamma)_(;alpha)).(40.86)]:}\begin{aligned} 0= & \left(\delta \rho_{0}\right)_{; \alpha} v^{\alpha}+\delta \rho_{0} v^{\alpha}{ }_{; \alpha}+\left(\rho_{0}\right)_{; \alpha} \delta v^{\alpha}+\rho_{0}\left(\delta v^{\alpha}\right)_{; \alpha} \\ = & \left(\delta \rho_{0}\right)_{; \alpha} v^{\alpha}+\delta \rho_{0} v^{\alpha}{ }_{; \alpha}+\left(\rho_{0}\right)_{; \alpha}\left(v^{\alpha}{ }_{; \beta} K^{\beta}-K^{\alpha}{ }_{; \beta} v^{\beta}-v^{\alpha} v^{\beta} K_{\beta ; \gamma} v^{\gamma}\right) \\ & +\rho_{0}\left(v^{\alpha}{ }_{; \beta} K^{\beta}-K^{\alpha}{ }_{; \beta} v^{\beta}-v^{\alpha} v^{\beta} K_{\beta ; \gamma} v^{\gamma}\right)_{; \alpha} \\ = & \left(\delta \rho_{0}\right)_{; \alpha} v^{\alpha}+\delta \rho_{0} v^{\alpha}{ }_{; \alpha}+\left(\rho_{0}\right)_{; \alpha}\left(v^{\alpha}{ }_{; \beta} K^{\beta}-K_{; \beta}^{\alpha} v^{\beta}-v^{\alpha} v^{\beta} K_{\beta ; \gamma} v^{\gamma}\right) \\ & +\rho_{0}\left(v^{\alpha}{ }_{; \beta \alpha} K^{\beta}+v_{; \beta}^{\alpha} K_{; \alpha}^{\beta}-K^{\alpha}{ }_{; \beta \alpha} v^{\beta}-K_{; \beta}^{\alpha} v^{\beta}{ }_{; \alpha}\right. \\ & \left.-v^{\alpha}{ }_{; \alpha} v^{\beta} K_{\beta ; \gamma} v^{\gamma}-v^{\alpha} v_{; \alpha}^{\beta} K_{\beta ; \gamma} v^{\gamma}-v^{\alpha} v^{\beta} K_{\beta ; \gamma \alpha} v^{\gamma}-v^{\alpha} v^{\beta} K_{\beta ; \gamma} v^{\gamma}{ }_{; \alpha}\right) .(40.86) \end{aligned}0=(δρ0);αvα+δρ0vα;α+(ρ0);αδvα+ρ0(δvα);α=(δρ0);αvα+δρ0vα;α+(ρ0);α(vα;βKβKα;βvβvαvβKβ;γvγ)+ρ0(vα;βKβKα;βvβvαvβKβ;γvγ);α=(δρ0);αvα+δρ0vα;α+(ρ0);α(vα;βKβK;βαvβvαvβKβ;γvγ)+ρ0(vα;βαKβ+v;βαK;αβKα;βαvβK;βαvβ;αvα;αvβKβ;γvγvαv;αβKβ;γvγvαvβKβ;γαvγvαvβKβ;γvγ;α).(40.86)
This looks truly formidable, but can be simplified dramatically if we integrate along the flow lines. The key here is that covariant derivatives (identified in the above using the semicolon notation) are directional derivatives along the flow lines. Integrating along the flow lines can then be thought of as undoing these covariant derivatives. Along the flow lines we have local mass conservation 27 27 ^(27){ }^{27}27 written as J α ; α = ( ρ 0 v α ) ; α = J α ; α = ρ 0 v α ; α = J^(alpha)_(;alpha)=(rho_(0)v^(alpha))_(;alpha)=J^{\alpha}{ }_{; \alpha}=\left(\rho_{0} v^{\alpha}\right)_{; \alpha}=Jα;α=(ρ0vα);α= 0 . It turns out that if we collect terms which contain a factor v α ; α v α ; α v^(alpha)_(;alpha)v^{\alpha}{ }_{; \alpha}vα;α and demand that these vanish, this provides the relationship between δ ρ 0 δ ρ 0 deltarho_(0)\delta \rho_{0}δρ0 and v α , ρ 0 v α , ρ 0 v^(alpha),rho_(0)v^{\alpha}, \rho_{0}vα,ρ0 and K α K α K^(alpha)K^{\alpha}Kα that we need. The instruction to integrate along the flow lines is especially important as it allows us to integrate those terms involving a covariant derivative by parts, effectively changing the positions of the semicolons. This allows us to massage the indices in such a way as to invoke ( ρ 0 v α ) ; α = 0 . 28 ρ 0 v α ; α = 0 . 28 (rho_(0)v^(alpha))_(;alpha)=0.^(28)\left(\rho_{0} v^{\alpha}\right)_{; \alpha}=0 .{ }^{28}(ρ0vα);α=0.28 The terms contributing a factor v α ; α v α ; α v^(alpha)_(;alpha)v^{\alpha}{ }_{; \alpha}vα;α are
(40.88) flow lines ( δ ρ 0 v ; α α + ρ 0 v ; β α α K β ρ 0 v ; α α v β K β ; γ v γ ) = 0 (40.88) flow lines  δ ρ 0 v ; α α + ρ 0 v ; β α α K β ρ 0 v ; α α v β K β ; γ v γ = 0 {:(40.88)int_("flow lines ")(deltarho_(0)v_(;alpha)^(alpha)+rho_(0)v_(;beta alpha)^(alpha)K^(beta)-rho_(0)v_(;alpha)^(alpha)v^(beta)K_(beta;gamma)v^(gamma))=0:}\begin{equation*} \int_{\text {flow lines }}\left(\delta \rho_{0} v_{; \alpha}^{\alpha}+\rho_{0} v_{; \beta \alpha}^{\alpha} K^{\beta}-\rho_{0} v_{; \alpha}^{\alpha} v^{\beta} K_{\beta ; \gamma} v^{\gamma}\right)=0 \tag{40.88} \end{equation*}(40.88)flow lines (δρ0v;αα+ρ0v;βααKβρ0v;ααvβKβ;γvγ)=0
It's not obvious that the second term is one of the ones we want, until we integrate it by parts to get ( ρ 0 K β ) ; β v α ; α ρ 0 K β ; β v α ; α -int(rho_(0)K^(beta))_(;beta)v^(alpha)_(;alpha)-\int\left(\rho_{0} K^{\beta}\right)_{; \beta} v^{\alpha}{ }_{; \alpha}(ρ0Kβ);βvα;α.
We have now collected all terms in the integrand with a factor v α ; α v α ; α v^(alpha)_(;alpha)v^{\alpha}{ }_{; \alpha}vα;α, and since the divergence of v α v α v^(alpha)v^{\alpha}vα does not vanish (unless the density is constant) this integrand must do. The result is an equation for δ ρ 0 δ ρ 0 deltarho_(0)\delta \rho_{0}δρ0 :
(40.89) δ ρ 0 = ( ρ 0 K β ) ; β + ρ 0 K β ; γ v β v γ (40.89) δ ρ 0 = ρ 0 K β ; β + ρ 0 K β ; γ v β v γ {:(40.89)deltarho_(0)=(rho_(0)K^(beta))_(;beta)+rho_(0)K_(beta;gamma)v^(beta)v^(gamma):}\begin{equation*} \delta \rho_{0}=\left(\rho_{0} K^{\beta}\right)_{; \beta}+\rho_{0} K_{\beta ; \gamma} v^{\beta} v^{\gamma} \tag{40.89} \end{equation*}(40.89)δρ0=(ρ0Kβ);β+ρ0Kβ;γvβvγ
We can plug this into eqn 40.80 , which expresses the variation of the action.
27 27 ^(27){ }^{27}27 In general, as we shall show below, they don't obey the geodesic equation and so v β ( v α ) ; β 0 v β v α ; β 0 v^(beta)(v^(alpha))_(;beta)!=0v^{\beta}\left(v^{\alpha}\right)_{; \beta} \neq 0vβ(vα);β0. This is due to the possibility of pressure gradients perpendicular to the flow lines in the fluid.
28 As 28 As ^(28)As{ }^{28} \mathrm{As}28As an example of integration by parts, consider
v ; β α α K β = [ v α ; β K β ] (40.87) v ; α α K ; β β , v ; β α α K β = v α ; β K β (40.87) v ; α α K ; β β , {:[intv_(;beta alpha)^(alpha)K^(beta)=[v^(alpha)_(;beta)K_(beta)]_(-oo)^(oo)],[(40.87)-intv_(;alpha)^(alpha)K_(;beta)^(beta)","]:}\begin{align*} \int v_{; \beta \alpha}^{\alpha} K^{\beta}= & {\left[v^{\alpha}{ }_{; \beta} K_{\beta}\right]_{-\infty}^{\infty} } \\ & -\int v_{; \alpha}^{\alpha} K_{; \beta}^{\beta}, \tag{40.87} \end{align*}v;βααKβ=[vα;βKβ](40.87)v;ααK;ββ,
where the integrals are carried out along the flow lines. The first term on the right vanishes as we demand that all fields die off to zero at infinity. This leave the second term. We see how the position of the ; β β beta\betaβ has changed and we have picked up a minus sign.
29 29 ^(29){ }^{29}29 This is eqn 39.77 from the last chapter.
30 30 ^(30){ }^{30}30 This is discussed in Exercise 34.3.
Step IV: The variation in the Lagrangian is therefore
(40.90) δ S = d 4 x g { [ ( ρ 0 K β ) ; β + ρ 0 K β ; γ v β v γ ] [ 1 + d ( ρ 0 u ) d ρ 0 ] } = 0 (40.90) δ S = d 4 x g ρ 0 K β ; β + ρ 0 K β ; γ v β v γ 1 + d ρ 0 u d ρ 0 = 0 {:(40.90)delta S=-intd^(4)xsqrt(-g){[(rho_(0)K^(beta))_(;beta)+rho_(0)K_(beta;gamma)v^(beta)v^(gamma)][1+(d(rho_(0)u))/(drho_(0))]}=0:}\begin{equation*} \delta S=-\int \mathrm{d}^{4} x \sqrt{-g}\left\{\left[\left(\rho_{0} K^{\beta}\right)_{; \beta}+\rho_{0} K_{\beta ; \gamma} v^{\beta} v^{\gamma}\right]\left[1+\frac{\mathrm{d}\left(\rho_{0} u\right)}{\mathrm{d} \rho_{0}}\right]\right\}=0 \tag{40.90} \end{equation*}(40.90)δS=d4xg{[(ρ0Kβ);β+ρ0Kβ;γvβvγ][1+d(ρ0u)dρ0]}=0
Step V: In the final step, we integrate this equation by parts, moving the semicolons from the factors of K β K β K^(beta)K^{\beta}Kβ onto the other terms, in such a way that they have a common factor of K β K β K^(beta)K^{\beta}Kβ. The result is
δ S = d 4 x g { ρ 0 [ 1 + d ( ρ 0 u ) d ρ 0 ] v α ; β v β + ρ 0 [ d ( ρ 0 u ) d ρ 0 ] , γ ( g γ α + v γ v α ) } K α = 0 δ S = d 4 x g ρ 0 1 + d ρ 0 u d ρ 0 v α ; β v β + ρ 0 d ρ 0 u d ρ 0 , γ g γ α + v γ v α K α = 0 delta S=intd^(4)xsqrt(-g){rho_(0)[1+(d(rho_(0)u))/(drho_(0))]v^(alpha)_(;beta)v^(beta)+rho_(0)[((d)(rho_(0)u))/(drho_(0))]_(,gamma)(g^(gamma alpha)+v^(gamma)v^(alpha))}K_(alpha)=0\delta S=\int \mathrm{d}^{4} x \sqrt{-g}\left\{\rho_{0}\left[1+\frac{\mathrm{d}\left(\rho_{0} u\right)}{\mathrm{d} \rho_{0}}\right] v^{\alpha}{ }_{; \beta} v^{\beta}+\rho_{0}\left[\frac{\mathrm{~d}\left(\rho_{0} u\right)}{\mathrm{d} \rho_{0}}\right]_{, \gamma}\left(g^{\gamma \alpha}+v^{\gamma} v^{\alpha}\right)\right\} K_{\alpha}=0δS=d4xg{ρ0[1+d(ρ0u)dρ0]vα;βvβ+ρ0[ d(ρ0u)dρ0],γ(gγα+vγvα)}Kα=0.
Since K α K α K_(alpha)K_{\alpha}Kα does not vanish, the integrand must do, and we have our result for the variation of the fluid action. Finally, we interpret v ˙ α = v α ; β v β v ˙ α = v α ; β v β v^(˙)^(alpha)=v^(alpha)_(;beta)v^(beta)\dot{v}^{\alpha}=v^{\alpha}{ }_{; \beta} v^{\beta}v˙α=vα;βvβ as the acceleration of the flow lines and recall the pressure of the fluid is given by p = ρ 0 2 u / ρ 0 p = ρ 0 2 u / ρ 0 p=rho_(0)^(2)del u//delrho_(0)p=\rho_{0}^{2} \partial u / \partial \rho_{0}p=ρ02u/ρ0.
We conclude that the equation of motion for the fluid is given by
(40.92) ( ρ + p ) v ˙ α = p , β ( g α β + v β v α ) (40.92) ( ρ + p ) v ˙ α = p , β g α β + v β v α {:(40.92)(rho+p)v^(˙)^(alpha)=-p_(,beta)(g^(alpha beta)+v^(beta)v^(alpha)):}\begin{equation*} (\rho+p) \dot{v}^{\alpha}=-p_{, \beta}\left(g^{\alpha \beta}+v^{\beta} v^{\alpha}\right) \tag{40.92} \end{equation*}(40.92)(ρ+p)v˙α=p,β(gαβ+vβvα)
where the energy density is written as ρ = ρ 0 ( 1 + u ) . 29 ρ = ρ 0 ( 1 + u ) . 29 rho=rho_(0)(1+u).^(29)\rho=\rho_{0}(1+u) .{ }^{29}ρ=ρ0(1+u).29 This the equation of motion for the fluid, Euler's equation, written for curved spacetime. As promised, the geodesic equation v ˙ α = 0 v ˙ α = 0 v^(˙)^(alpha)=0\dot{v}^{\alpha}=0v˙α=0 is not obeyed: the flow lines accelerate in response to transverse pressure gradients.
Next, we extract the energy-momentum tensor using out prescription from earlier (eqn 40.57) that the up components are given by T μ ν = T μ ν = T^(mu nu)=T^{\mu \nu}=Tμν= 2 δ L m δ g μ ν + g μ ν L m 2 δ L m δ g μ ν + g μ ν L m 2(deltaL_(m))/(deltag_(mu nu))+g^(mu nu)L_(m)2 \frac{\delta \mathcal{L}_{\mathrm{m}}}{\delta g_{\mu \nu}}+g^{\mu \nu} \mathcal{L}_{\mathrm{m}}2δLmδgμν+gμνLm.

Example 40.15

Starting with the matter Lagrangian L = ρ 0 ( 1 + u ) L = ρ 0 ( 1 + u ) L=-rho_(0)(1+u)\mathcal{L}=-\rho_{0}(1+u)L=ρ0(1+u), we have that
(40.93) ( L m g α β ) ρ 0 = ( L m ρ 0 ) g α β ( ρ 0 g α β ) L = ( 1 + u + ρ 0 d u d ρ 0 ) ρ 0 g α β (40.93) L m g α β ρ 0 = L m ρ 0 g α β ρ 0 g α β L = 1 + u + ρ 0 d u d ρ 0 ρ 0 g α β {:(40.93)((delL_(m))/(delg_(alpha beta)))_(rho_(0))=-((delL_(m))/(delrho_(0)))_(g_(alpha beta))((delrho_(0))/(delg_(alpha beta)))_(L)=(1+u+rho_(0)((d)u)/((d)rho_(0)))(delrho_(0))/(delg_(alpha beta)):}\begin{equation*} \left(\frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial g_{\alpha \beta}}\right)_{\rho_{0}}=-\left(\frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial \rho_{0}}\right)_{g_{\alpha \beta}}\left(\frac{\partial \rho_{0}}{\partial g_{\alpha \beta}}\right)_{\mathcal{L}}=\left(1+u+\rho_{0} \frac{\mathrm{~d} u}{\mathrm{~d} \rho_{0}}\right) \frac{\partial \rho_{0}}{\partial g_{\alpha \beta}} \tag{40.93} \end{equation*}(40.93)(Lmgαβ)ρ0=(Lmρ0)gαβ(ρ0gαβ)L=(1+u+ρ0 du dρ0)ρ0gαβ
We need access to the final term. This is achieved with a trick: the divergence of the current can be written in terms of the determinant of the metric matrix as 30 30 ^(30){ }^{30}30
(40.94) ( J α ) ; α = 1 ( g ) 1 2 x α [ ( g ) 1 2 J α ] = 0 (40.94) J α ; α = 1 ( g ) 1 2 x α ( g ) 1 2 J α = 0 {:(40.94)(J^(alpha))_(;alpha)=(1)/((-g)^((1)/(2)))(del)/(delx^(alpha))[(-g)^((1)/(2))J^(alpha)]=0:}\begin{equation*} \left(J^{\alpha}\right)_{; \alpha}=\frac{1}{(-g)^{\frac{1}{2}}} \frac{\partial}{\partial x^{\alpha}}\left[(-g)^{\frac{1}{2}} J^{\alpha}\right]=0 \tag{40.94} \end{equation*}(40.94)(Jα);α=1(g)12xα[(g)12Jα]=0
This is helpful as it tells us that the quantity ( g ) 1 2 J α ( g ) 1 2 J α (-g)^((1)/(2))J^(alpha)(-g)^{\frac{1}{2}} J^{\alpha}(g)12Jα is unchanged as the metric is varied. Dotting two currents together we have
(40.95) ρ 0 2 = g 1 [ ( g ) 1 2 J α ( g ) 1 2 J β ) g α β ] , (40.95) ρ 0 2 = g 1 ( g ) 1 2 J α ( g ) 1 2 J β g α β , {:(40.95){:rho_(0)^(2)=g^(-1)[(-g)^((1)/(2))J^(alpha)(-g)^((1)/(2))J^(beta))g_(alpha beta)]",":}\begin{equation*} \left.\rho_{0}^{2}=g^{-1}\left[(-g)^{\frac{1}{2}} J^{\alpha}(-g)^{\frac{1}{2}} J^{\beta}\right) g_{\alpha \beta}\right], \tag{40.95} \end{equation*}(40.95)ρ02=g1[(g)12Jα(g)12Jβ)gαβ],
and so
2 ρ 0 δ ρ 0 = ( J α J β J γ J γ g α β ) δ g α β (40.96) = ρ 0 2 ( v α v β v γ v γ g α β ) δ g α β , 2 ρ 0 δ ρ 0 = J α J β J γ J γ g α β δ g α β (40.96) = ρ 0 2 v α v β v γ v γ g α β δ g α β , {:[2rho_(0)deltarho_(0)=(J^(alpha)J^(beta)-J^(gamma)J_(gamma)g^(alpha beta))deltag_(alpha beta)],[(40.96)=rho_(0)^(2)(v^(alpha)v^(beta)-v^(gamma)v_(gamma)g^(alpha beta))deltag_(alpha beta)","]:}\begin{align*} 2 \rho_{0} \delta \rho_{0} & =\left(J^{\alpha} J^{\beta}-J^{\gamma} J_{\gamma} g^{\alpha \beta}\right) \delta g_{\alpha \beta} \\ & =\rho_{0}^{2}\left(v^{\alpha} v^{\beta}-v^{\gamma} v_{\gamma} g^{\alpha \beta}\right) \delta g_{\alpha \beta}, \tag{40.96} \end{align*}2ρ0δρ0=(JαJβJγJγgαβ)δgαβ(40.96)=ρ02(vαvβvγvγgαβ)δgαβ,
giving
(40.97) 2 δ ρ 0 = ρ 0 ( v α v β + g α β ) δ g α β (40.97) 2 δ ρ 0 = ρ 0 v α v β + g α β δ g α β {:(40.97)2deltarho_(0)=rho_(0)(v^(alpha)v^(beta)+g^(alpha beta))deltag_(alpha beta):}\begin{equation*} 2 \delta \rho_{0}=\rho_{0}\left(v^{\alpha} v^{\beta}+g^{\alpha \beta}\right) \delta g_{\alpha \beta} \tag{40.97} \end{equation*}(40.97)2δρ0=ρ0(vαvβ+gαβ)δgαβ
Using δ ρ 0 = ρ 0 g α β δ g α β δ ρ 0 = ρ 0 g α β δ g α β deltarho_(0)=(delrho_(0))/(delg_(alpha beta))deltag_(alpha beta)\delta \rho_{0}=\frac{\partial \rho_{0}}{\partial g_{\alpha \beta}} \delta g_{\alpha \beta}δρ0=ρ0gαβδgαβ, we extract the result we were after
(40.98) ρ 0 g α β = 1 2 ρ 0 ( v α v β + g α β ) (40.98) ρ 0 g α β = 1 2 ρ 0 v α v β + g α β {:(40.98)(delrho_(0))/(delg_(alpha beta))=(1)/(2)rho_(0)(v^(alpha)v^(beta)+g^(alpha beta)):}\begin{equation*} \frac{\partial \rho_{0}}{\partial g_{\alpha \beta}}=\frac{1}{2} \rho_{0}\left(v^{\alpha} v^{\beta}+g^{\alpha \beta}\right) \tag{40.98} \end{equation*}(40.98)ρ0gαβ=12ρ0(vαvβ+gαβ)
The first term in the energy-momentum tensor becomes
(40.99) 2 δ L m δ g α β = ( 1 + u + ρ 0 d u d ρ 0 ) ρ 0 ( v α v β + g α β ) (40.99) 2 δ L m δ g α β = 1 + u + ρ 0 d u d ρ 0 ρ 0 v α v β + g α β {:(40.99)2(deltaL_(m))/(deltag_(alpha beta))=(1+u+rho_(0)((d)u)/((d)rho_(0)))rho_(0)(v^(alpha)v^(beta)+g^(alpha beta)):}\begin{equation*} 2 \frac{\delta \mathcal{L}_{\mathrm{m}}}{\delta g_{\alpha \beta}}=\left(1+u+\rho_{0} \frac{\mathrm{~d} u}{\mathrm{~d} \rho_{0}}\right) \rho_{0}\left(v^{\alpha} v^{\beta}+g^{\alpha \beta}\right) \tag{40.99} \end{equation*}(40.99)2δLmδgαβ=(1+u+ρ0 du dρ0)ρ0(vαvβ+gαβ)
We add to this the term
(40.100) g α β L m = ρ 0 ( 1 + u ) g α β (40.100) g α β L m = ρ 0 ( 1 + u ) g α β {:(40.100)g^(alpha beta)L_(m)=-rho_(0)(1+u)g^(alpha beta):}\begin{equation*} g^{\alpha \beta} \mathcal{L}_{\mathrm{m}}=-\rho_{0}(1+u) g^{\alpha \beta} \tag{40.100} \end{equation*}(40.100)gαβLm=ρ0(1+u)gαβ
Putting this all together, we conclude that the components of the energymomentum tensor are given by
(40.101) T α β = [ ρ 0 ( 1 + u ) + ρ 0 2 d u d ρ 0 ] v α v β + ρ 0 2 d u d ρ 0 g α β . (40.101) T α β = ρ 0 ( 1 + u ) + ρ 0 2 d u d ρ 0 v α v β + ρ 0 2 d u d ρ 0 g α β . {:(40.101)T^(alpha beta)=[rho_(0)(1+u)+rho_(0)^(2)((d)u)/((d)rho_(0))]v^(alpha)v^(beta)+rho_(0)^(2)((d)u)/((d)rho_(0))g^(alpha beta).:}\begin{equation*} T^{\alpha \beta}=\left[\rho_{0}(1+u)+\rho_{0}^{2} \frac{\mathrm{~d} u}{\mathrm{~d} \rho_{0}}\right] v^{\alpha} v^{\beta}+\rho_{0}^{2} \frac{\mathrm{~d} u}{\mathrm{~d} \rho_{0}} g^{\alpha \beta} . \tag{40.101} \end{equation*}(40.101)Tαβ=[ρ0(1+u)+ρ02 du dρ0]vαvβ+ρ02 du dρ0gαβ.
We may reinterpret this using the symbols used above, to find the components of the energy-momentum tensor for the perfect fluid are given by
(40.102) T α β = ( ρ + p ) v α v β + p g α β . (40.102) T α β = ( ρ + p ) v α v β + p g α β . {:(40.102)T^(alpha beta)=(rho+p)v^(alpha)v^(beta)+pg^(alpha beta).:}\begin{equation*} T^{\alpha \beta}=(\rho+p) v^{\alpha} v^{\beta}+p g^{\alpha \beta} . \tag{40.102} \end{equation*}(40.102)Tαβ=(ρ+p)vαvβ+pgαβ.
This completes our review of the Lagrangian formulation. We shall use it in the next two chapters, where we use it in thinking about symmetry breaking in cosmology (Chapter 41) and then meet one of the most successful field theories: electromagnetism (Chapter 42).

Chapter summary

  • Equations of motion for matter fields can be found using
(40.103) L m A ν = ( L m A ν ; μ ) ; μ . (40.103) L m A ν = L m A ν ; μ ; μ . {:(40.103)(delL_(m))/(delA^(nu))=((delL_(m))/(delA^(nu)_(;mu)))_(;mu).:}\begin{equation*} \frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial A^{\nu}}=\left(\frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial A^{\nu}{ }_{; \mu}}\right)_{; \mu} . \tag{40.103} \end{equation*}(40.103)LmAν=(LmAν;μ);μ.
  • The Einstein-Hilbert action is given by
(40.104) S EH = d 4 x g ( x ) R ( x ) . (40.104) S EH = d 4 x g ( x ) R ( x ) . {:(40.104)S_(EH)=intd^(4)xsqrt(-g(x))R(x).:}\begin{equation*} S_{\mathrm{EH}}=\int \mathrm{d}^{4} x \sqrt{-g(x)} R(x) . \tag{40.104} \end{equation*}(40.104)SEH=d4xg(x)R(x).
  • The components of the energy-momentum tensor can be found using the expression
(40.105) T μ ν = 2 δ L m δ g μ ν + g μ ν L m (40.105) T μ ν = 2 δ L m δ g μ ν + g μ ν L m {:(40.105)T^(mu nu)=2(deltaL_(m))/(deltag_(mu nu))+g^(mu nu)L_(m):}\begin{equation*} T^{\mu \nu}=2 \frac{\delta \mathcal{L}_{\mathrm{m}}}{\delta g_{\mu \nu}}+g^{\mu \nu} \mathcal{L}_{\mathrm{m}} \tag{40.105} \end{equation*}(40.105)Tμν=2δLmδgμν+gμνLm

Exercises

(40.1) Consider the Lagrangian for a particle of unit mass: L ( x μ , x ˙ μ ) = 1 2 g μ ν x ˙ μ x ˙ ν L x μ , x ˙ μ = 1 2 g μ ν x ˙ μ x ˙ ν L(x^(mu),x^(˙)^(mu))=(1)/(2)g_(mu nu)x^(˙)^(mu)x^(˙)^(nu)L\left(x^{\mu}, \dot{x}^{\mu}\right)=\frac{1}{2} g_{\mu \nu} \dot{x}^{\mu} \dot{x}^{\nu}L(xμ,x˙μ)=12gμνx˙μx˙ν, where dots denote a derivative with respect to proper time τ τ tau\tauτ.
(a) Confirm that the equation of motion for the particle is the geodesic law.
Recall from Chapter 2 that we define the canonical momentum as
(40.106) p μ = L x ˙ μ , (40.106) p μ = L x ˙ μ , {:(40.106)p_(mu)=(del L)/(delx^(˙)^(mu))",":}\begin{equation*} p_{\mu}=\frac{\partial L}{\partial \dot{x}^{\mu}}, \tag{40.106} \end{equation*}(40.106)pμ=Lx˙μ,
and the Hamiltonian as the function
(40.107) H ( x μ , p μ ) = p μ x ˙ μ L ( x μ , x ˙ μ ) (40.107) H x μ , p μ = p μ x ˙ μ L x μ , x ˙ μ {:(40.107)H(x^(mu),p_(mu))=p_(mu)x^(˙)^(mu)-L(x^(mu),x^(˙)^(mu)):}\begin{equation*} H\left(x^{\mu}, p_{\mu}\right)=p_{\mu} \dot{x}^{\mu}-L\left(x^{\mu}, \dot{x}^{\mu}\right) \tag{40.107} \end{equation*}(40.107)H(xμ,pμ)=pμx˙μL(xμ,x˙μ)
(b) Show that the Hamiltonian derived from our example Lagrangian is
(40.108) H = 1 2 g μ ν p μ p ν (40.108) H = 1 2 g μ ν p μ p ν {:(40.108)H=(1)/(2)g^(mu nu)p_(mu)p_(nu):}\begin{equation*} H=\frac{1}{2} g^{\mu \nu} p_{\mu} p_{\nu} \tag{40.108} \end{equation*}(40.108)H=12gμνpμpν
The Hamiltonian allows access to another route to finding the equations of motion in terms of p μ p μ p_(mu)p_{\mu}pμ, and
x μ x μ x^(mu)x^{\mu}xμ via a pair of first-order derivatives
(40.109) H p μ = x ˙ μ , H x μ = p ˙ μ . (40.109) H p μ = x ˙ μ , H x μ = p ˙ μ . {:(40.109)(del H)/(delp_(mu))=x^(˙)^(mu)","quad(del H)/(delx^(mu))=-p^(˙)_(mu).:}\begin{equation*} \frac{\partial H}{\partial p_{\mu}}=\dot{x}^{\mu}, \quad \frac{\partial H}{\partial x^{\mu}}=-\dot{p}_{\mu} . \tag{40.109} \end{equation*}(40.109)Hpμ=x˙μ,Hxμ=p˙μ.
These are Hamilton's equations.
(c) Use Hamilton's equations to compute the geodesic equation again.
(d) What happens to the equation of motion if a metric coefficient is independent of one of the coordinates?
(40.2) Assuming a Lagrangian of the same form as used in the last question, a particle in a Schwarzschild geometry has
L = 1 2 [ ( 1 2 M r ) t ˙ 2 + ( 1 2 M r ) 1 r ˙ 2 (40.110) + r 2 θ ˙ 2 + r 2 sin 2 θ ϕ ˙ 2 ] L = 1 2 1 2 M r t ˙ 2 + 1 2 M r 1 r ˙ 2 (40.110) + r 2 θ ˙ 2 + r 2 sin 2 θ ϕ ˙ 2 {:[L=(1)/(2)[-(1-(2M)/(r))t^(˙)^(2)+(1-(2M)/(r))^(-1)r^(˙)^(2):}],[(40.110){:+r^(2)theta^(˙)^(2)+r^(2)sin^(2)thetaphi^(˙)^(2)]]:}\begin{align*} L= & \frac{1}{2}\left[-\left(1-\frac{2 M}{r}\right) \dot{t}^{2}+\left(1-\frac{2 M}{r}\right)^{-1} \dot{r}^{2}\right. \\ & \left.+r^{2} \dot{\theta}^{2}+r^{2} \sin ^{2} \theta \dot{\phi}^{2}\right] \tag{40.110} \end{align*}L=12[(12Mr)t˙2+(12Mr)1r˙2(40.110)+r2θ˙2+r2sin2θϕ˙2]
(a) Compute the momenta and show the Hamiltonian H = L H = L H=LH=LH=L.
(b) Use Hamilton's equations to show that p t p t p_(t)p_{t}pt and p ϕ p ϕ p_(phi)p_{\phi}pϕ are constants of the motion.
(40.3) Consider the Lagrangian density
(40.111) L = 1 2 ϕ ˙ 2 1 2 ( ϕ ) 2 1 2 m 2 ϕ 2 (40.111) L = 1 2 ϕ ˙ 2 1 2 ( ϕ ) 2 1 2 m 2 ϕ 2 {:(40.111)L=(1)/(2)*phi^(˙)^(2)-(1)/(2)( vec(grad)phi)^(2)-(1)/(2)m^(2)phi^(2):}\begin{equation*} \mathcal{L}=\frac{1}{2} \cdot \dot{\phi}^{2}-\frac{1}{2}(\vec{\nabla} \phi)^{2}-\frac{1}{2} m^{2} \phi^{2} \tag{40.111} \end{equation*}(40.111)L=12ϕ˙212(ϕ)212m2ϕ2
restricted to Minkowski space.
By analogy with the state of affairs in the last question, we define the momentum density as
(40.112) π = L ϕ ˙ (40.112) π = L ϕ ˙ {:(40.112)pi=(delL)/(del(phi^(˙))):}\begin{equation*} \pi=\frac{\partial \mathcal{L}}{\partial \dot{\phi}} \tag{40.112} \end{equation*}(40.112)π=Lϕ˙
(a) Show that the momentum density for our Lagrangian is given by
(40.113) π = ϕ ˙ (40.113) π = ϕ ˙ {:(40.113)pi=phi^(˙):}\begin{equation*} \pi=\dot{\phi} \tag{40.113} \end{equation*}(40.113)π=ϕ˙
Again, by analogy, we define the Hamiltonian density as
(40.114) H = π ϕ ˙ L (40.114) H = π ϕ ˙ L {:(40.114)H=piphi^(˙)-L:}\begin{equation*} \mathcal{H}=\pi \dot{\phi}-\mathcal{L} \tag{40.114} \end{equation*}(40.114)H=πϕ˙L
(b) Show that, for our example, we find
(40.115) H = 1 2 ϕ ˙ 2 + 1 2 ( ϕ ) 2 + 1 2 m 2 ϕ 2 (40.115) H = 1 2 ϕ ˙ 2 + 1 2 ( ϕ ) 2 + 1 2 m 2 ϕ 2 {:(40.115)H=(1)/(2)phi^(˙)^(2)+(1)/(2)( vec(grad)phi)^(2)+(1)/(2)m^(2)phi^(2):}\begin{equation*} \mathcal{H}=\frac{1}{2} \dot{\phi}^{2}+\frac{1}{2}(\vec{\nabla} \phi)^{2}+\frac{1}{2} m^{2} \phi^{2} \tag{40.115} \end{equation*}(40.115)H=12ϕ˙2+12(ϕ)2+12m2ϕ2
(c) We also define the momentum-like quantities
(40.116) Π μ = L ( μ ϕ ) (40.116) Π μ = L μ ϕ {:(40.116)Pi^(mu)=(delL)/(del(del_(mu)phi)):}\begin{equation*} \Pi^{\mu}=\frac{\partial \mathcal{L}}{\partial\left(\partial_{\mu} \phi\right)} \tag{40.116} \end{equation*}(40.116)Πμ=L(μϕ)
Show that Π μ = μ ϕ Π μ = μ ϕ Pi^(mu)=del^(mu)phi\Pi^{\mu}=\partial^{\mu} \phiΠμ=μϕ and Π 0 = π = ϕ ˙ Π 0 = π = ϕ ˙ Pi^(0)=pi=phi^(˙)\Pi^{0}=\pi=\dot{\phi}Π0=π=ϕ˙.
(40.4) The symmetry of T T T\boldsymbol{T}T is necessary for angular momentum to be conserved. We define an antisymmetric four-dimensional angular-momentum tensor for a particle of J i k = x i p k x k p i J i k = x i p k x k p i J^(ik)=x^(i)p^(k)-x^(k)p^(i)J^{i k}=x^{i} p^{k}-x^{k} p^{i}Jik=xipkxkpi. In flat space and Cartesian coordinates, the continuous version is then
(40.117) J i k = d 3 x ( x i T k 0 x k T i 0 ) (40.117) J i k = d 3 x x i T k 0 x k T i 0 {:(40.117)J^(ik)=intd^(3)x(x^(i)T^(k0)-x^(k)T^(i0)):}\begin{equation*} J^{i k}=\int \mathrm{d}^{3} x\left(x^{i} T^{k 0}-x^{k} T^{i 0}\right) \tag{40.117} \end{equation*}(40.117)Jik=d3x(xiTk0xkTi0)
We then define a tensor with components M μ ν σ = M μ ν σ = M^(mu nu sigma)=M^{\mu \nu \sigma}=Mμνσ= x μ T ν σ x ν T μ σ x μ T ν σ x ν T μ σ x^(mu)T^(nu sigma)-x^(nu)T^(mu sigma)x^{\mu} T^{\nu \sigma}-x^{\nu} T^{\mu \sigma}xμTνσxνTμσ. Conservation of angular momentum in flat ( 3 + 1 ) ( 3 + 1 ) (3+1)(3+1)(3+1)-dimensional spacetime is then expressed via
(40.118) M μ ν σ x σ = x σ ( x μ T ν σ x ν T μ σ ) = 0 (40.118) M μ ν σ x σ = x σ x μ T ν σ x ν T μ σ = 0 {:(40.118)(delM^(mu nu sigma))/(delx^(sigma))=(del)/(delx^(sigma))(x^(mu)T^(nu sigma)-x^(nu)T^(mu sigma))=0:}\begin{equation*} \frac{\partial M^{\mu \nu \sigma}}{\partial x^{\sigma}}=\frac{\partial}{\partial x^{\sigma}}\left(x^{\mu} T^{\nu \sigma}-x^{\nu} T^{\mu \sigma}\right)=0 \tag{40.118} \end{equation*}(40.118)Mμνσxσ=xσ(xμTνσxνTμσ)=0
Show that for this to be true, T T T\boldsymbol{T}T must be symmetric.
(40.5) Show that eqn 40.88 is consistent with local mass conservation, as described in eqn 40.86 .

Inflation

Before she had drunk half the bottle, she found her head pressing against the ceiling, and had to stoop to save her neck from being broken. She hastily put down the bottle, remarking, 'That's quite enough - I hope I sha'n't grow any more.' Lewis Carroll (1832-1898) Alice's Adventures in Wonderland
Our story so far is that the Universe can be described, to a good approximation, as a Robertson-Walker spacetime that started in an initial Big-Bang singularity. However, there are some problems with an explanation of the current state of our Universe using this standard model of cosmology. We mentioned in Chapter 18 that observations suggest the Universe appears very close to being a spatially flat, or k = 0 k = 0 k=0k=0k=0, spacetime. This can be expressed using the form of the Friedmann equation that reads
(41.1) k a ˙ 2 = Ω 1 . (41.1) k a ˙ 2 = Ω 1 . {:(41.1)(k)/(a^(˙)^(2))=Omega-1.:}\begin{equation*} \frac{k}{\dot{a}^{2}}=\Omega-1 . \tag{41.1} \end{equation*}(41.1)ka˙2=Ω1.
The implication is either that k = 0 k = 0 k=0k=0k=0 for all time, or that the current density ratio Ω = ρ / ρ c 1 Ω = ρ / ρ c 1 Omega=rho//rho_(c)~~1\Omega=\rho / \rho_{c} \approx 1Ω=ρ/ρc1. Our observations provide an estimate of Ω 0.3 Ω 0.3 Omega~~0.3\Omega \approx 0.3Ω0.3, which isn't unity, but isn't massively far away either. In the case that k = 0 k = 0 k=0k=0k=0, we might ask how and why the Universe was set up to be flat. If Ω 1 Ω 1 Omega~~1\Omega \approx 1Ω1 now, but was not so in the past, this would seems quite a coincidence, given the need for the Universe to be very finely tuned in the past to conspire to give this result only now. 1 1 ^(1){ }^{1}1 So what is the reason that we find Ω 1 Ω 1 Omega~~1\Omega \approx 1Ω1 ? This collection of questions is known as the flatness problem.
A further problem is that the Universe is also observed to be surprisingly homogeneous at earlier times. 2 2 ^(2){ }^{2}2 This is especially surprising in the context of a hot Big Bang model in which the matter in our Universe cools very rapidly as it expands.

Example 41.1

A rough picture of the earliest stages of the Universe can be summarized as follows Evolution occurs via several phase transitions, the phenomena that form the basis of this chapter
  • At a time t 10 43 s t 10 43 s t~~10^(-43)st \approx 10^{-43} \mathrm{~s}t1043 s after the Big Bang, the temperature scale of the Universe is T 10 32 K T 10 32 K T~~10^(32)KT \approx 10^{32} \mathrm{~K}T1032 K and the corresponding energy scale is E 10 19 GeV E 10 19 GeV E~~10^(19)GeVE \approx 10^{19} \mathrm{GeV}E1019GeV. Under these condition, the strong and electroweak interactions are unified into a single interaction. On cooling below this scale (at time t 10 36 s t 10 36 s t~~10^(-36)st \approx 10^{-36} \mathrm{~s}t1036 s, energy 10 15 GeV 10 15 GeV 10^(15)GeV10^{15} \mathrm{GeV}1015GeV or temperature 10 27 K 10 27 K 10^(27)K10^{27} \mathrm{~K}1027 K ) a phase transition 3 3 ^(3){ }^{3}3 takes place causing the strong and electroweak interactions to separate.

41

41.1 Symmetry breaking 446 41.2 Effective potentials 449 41.3 Why flat? 449 41.3 Why flat? 451 Chapter summary 452 Exercises 452
1 1 ^(1){ }^{1}1 The implications of the apparent finely tuned nature of the Universe have generated a lot of debate. For more on this issue, see the book A Fortunate Universe by Lewis and Barnes (listed in the Further Reading in Appendix A) or the article on this subject in the Stanford Encyclopedia of Philosophy (https://plato.stanford.edu/).
2 2 ^(2){ }^{2}2 The Cosmic Background Explorer (COBE) satellite showed that the cosmic microwave background (CMB) has a near-perfect black-body spectrum with average temperature T = 2.73 K T = 2.73 K T=2.73KT=2.73 \mathrm{~K}T=2.73 K, with only faint anisotropies at a level of 1 part in 10 5 10 5 10^(5)10^{5}105.
3 3 ^(3){ }^{3}3 This is known as the 'GUT phase transition', referencing putative Grand Unified (GUT) theories, that describe the combination of the strong and electroweak interactions.
4 4 ^(4){ }^{4}4 By 'not causally connected' we mean that photons from one region have not had time to travel to another region, much less establish thermal equilibrium.
Fig. 41.1 (a) Penrose diagram show ing the horizon problem. The dotted line represents a phase transition (P.T.) and B.B. labels the Big-Bang singularity. Events q q qqq and r r rrr do not have any shared history, and neither do events s s sss and t t ttt that take place at the instant of the phase transition. (b) Inflation pushes back the Big Bang, so that the pasts of events q q qqq and r r rrr overlap for a period after the Big Bang. Events s s sss and t t ttt that take place at the phase transition, then also have a shared history.
5 5 ^(5){ }^{5}5 Inflation was originally suggested by Alan Guth (1947- ) who examined the case of a scalar field in a spatially flat Universe as we do in this chapter. 6 6 ^(6){ }^{6}6 See Penrose (2004) for a discussion.
7 7 ^(7){ }^{7}7 L. D. Landau (1908-1968), perhaps best known for foundational work in condensed matter physics, also made contributions to general relativity, including the independent computation of the Chandrasekhar limit (the maxof the Chandrasekhar limit (the max-
imum mass of a stable white dwarf) in 1932.
Fig. 41.2 The magnet at T > T c T > T c T > T_(c)T>T_{\mathrm{c}}T>Tc. The magnetization, or average moment, is zero in (a) and, after rotation of each spin through 180 180 180^(@)180^{\circ}180, is still zero in (b).
  • At t 10 11 s ( t 10 11 s t~~10^(-11)s(:}t \approx 10^{-11} \mathrm{~s}\left(\right.t1011 s( or T 10 15 K , E 10 2 GeV ) T 10 15 K , E 10 2 GeV {:T~~10^(15)(K),E~~10^(2)GeV)\left.T \approx 10^{15} \mathrm{~K}, E \approx 10^{2} \mathrm{GeV}\right)T1015 K,E102GeV) another phase transition occurs causing the Higgs scalar field to take on a finite value. The result is that the electromagnetic and weak interactions separate. The Higgs mechanism causes some of the leptons to become massive. This is known as the electroweak symmetry-breaking transition.
  • After t 10 5 s t 10 5 s t~~10^(-5)st \approx 10^{-5} \mathrm{~s}t105 s the Universe is cool enough ( T 10 12 K , E 100 MeV T 10 12 K , E 100 MeV T~~10^(12)K,E~~100MeVT \approx 10^{12} \mathrm{~K}, E \approx 100 \mathrm{MeV}T1012 K,E100MeV ) for a further phase transition to occur. This one allows the strong interaction to increase in strength, leading to quark confinement.
The expansion of the early Universe implies that regions of space that cannot have been causally connected 4 4 ^(4){ }^{4}4 in the early stages of the cooling and expansion of the Universe are now found to be at almost exactly the same temperature. This is known as the horizon problem. Figure 41.1(a) shows a Penrose diagram illustrating the problem. An observer at p p ppp can see two events ( q ( q (q(q(q and r r rrr ) by looking in different directions. These events are not causally connected, in that they don't have histories that overlap at any period after the Big Bang. It is therefore a problem if the regions of spacetime around the events are near-identical.
To explain these problems we might suggest that the Universe underwent a period of very rapid expansion in its early history, known as inflation. 5 5 ^(5){ }^{5}5 This involves an acceleration of the scale factor a ( t ) a ( t ) a(t)a(t)a(t) of the Universe. [This is in contrast to most of the results we have examined in which a ( t ) a ( t ) a(t)a(t)a(t) is decelerating.] Inflation allows causally connected regions of spacetime to separate rapidly enough to give the homogeneous temperature distribution that is observed. As illustrated in Fig. 41.1(b), it effectively pushes back the Big Bang, allowing events taking place at a phase transition to have some region of shared history. It is notable that this does not necessarily solve the horizon problem, as although there is some communication in the past, events such as s s sss and t t ttt in Fig. 41.1(b) do not share much of their history, so might well be rather different at the point of the phase transition. 6 6 ^(6){ }^{6}6 Inflation also results in a spatially flat Universe.
Inflation can be understood using the machinery of field theory that we discussed in the last chapter. The key concept is one familiar from thermodynamics that has been invoked in the evolution of the early Universe: the symmetry-breaking phase transition.

41.1 Symmetry breaking

An intuitive and powerful description of a continuous phase transition was developed by Lev Landau. 7 7 ^(7){ }^{7}7 Our discussion of Landau's theory of phase transitions begins with a simple observation about magnets. We imagine a magnet whose spin moments can point either up or down. At high temperatures each spin is equally likely to be found up or down. This system is shown in Fig. 41.2(a). Its magnetization, that is, the spatial average of the magnetic moment, is zero. The system has a symmetry: turn all of the spins through 180 180 180^(@)180^{\circ}180 [as in Fig. 41.2(b)] and the magnetization is still zero. Of course, each individual moment is point-
ing in the opposite direction, but the number pointing in the upwards direction is still half of the total and the magnetization is still zero. The symmetry here is a global one: we rotate all of the spins through the same angle, here 180 180 180^(@)180^{\circ}180.
It is found experimentally that upon cooling the system through a critical temperature T c T c T_(c)T_{c}Tc, the system undergoes a phase transition and the magnetization M M MMM becomes non-zero as all of the spins line up along a single direction, as shown in Fig. 41.3(a). The direction along which the spins align could either be all in the up direction or the down direction. If we rotate each spin of the aligned system through 180 180 180^(@)180^{\circ}180 [Fig. 41.3(b)] M M MMM is obviously reversed. We say that the system has broken (or lowered) its symmetry in the ordered phase.
One puzzling feature of this description is the reason why the system chose to point all of the spins in one direction rather than the other. After all, there's nothing in the Hamiltonian which describes the system that distinguishes between up and down. 8 8 ^(8){ }^{8}8 The result is that the ground state does not have the symmetry of the Hamiltonian describing the system. The original symmetry of the system appears to have spontaneously broken. The same thing happens in the Euler strut shown in Fig. 41.4. A weight is balanced on top of an elastic strut. If the weight is large enough the strut will buckle. The buckling can be either to the left or to the right. There is nothing in the underlying physics of the strut and weight that allows one to predict which way it will go. 9 9 ^(9){ }^{9}9
Since equilibrium in thermodynamic systems relies on both minimizing the internal energy U U UUU and maximizing the entropy S S SSS of a system, Landau considered the free energy F = U T S F = U T S F=U-TSF=U-T SF=UTS. To find the equilibrium state of the system we need to minimize F F FFF. The free energy is a function of an order parameter, namely some field describing the system whose thermal average is zero in the T > T c T > T c T > T_(c)T>T_{\mathrm{c}}T>Tc unbroken-symmetry state and non-zero in the T < T c T < T c T < T_(c)T<T_{\mathrm{c}}T<Tc broken-symmetry state. For a magnet, the order parameter is simply the magnetization field M M MMM. With deliberate ignorance of the microscopic state of the system, Landau wrote F ( M ) F ( M ) F(M)F(M)F(M) as a power series
(41.2) F = F 0 + a M 2 + b M 4 + , (41.2) F = F 0 + a M 2 + b M 4 + , {:(41.2)F=F_(0)+aM^(2)+bM^(4)+dots",":}\begin{equation*} F=F_{0}+a M^{2}+b M^{4}+\ldots, \tag{41.2} \end{equation*}(41.2)F=F0+aM2+bM4+,
where a a aaa and b b bbb (the latter assumed > 0 > 0 > 0>0>0 ) are parameters which are independent of M M MMM, but may in principle depend on the temperature. This free energy has the symmetry M M M M M rarr-MM \rightarrow-MMM, that is, reversing the magnetization doesn't affect the energy. If a a aaa and b b bbb are positive we have the free energy shown in Fig. 41.5(a), which is minimized at M = 0 M = 0 M=0M=0M=0, which is the correct prediction for the high-temperature regime. If, however, the parameter a a aaa is negative then we have the free energy shown in Fig. 41.5(b), which has two minima at non-zero values of magnetization. These minima correspond to the spins all aligning up ( M = + M 0 ) M = + M 0 (M=+M_(0))\left(M=+M_{0}\right)(M=+M0) or all aligning down ( M = M 0 ) M = M 0 (M=-M_(0))\left(M=-M_{0}\right)(M=M0). The previous minimum M = 0 M = 0 M=0M=0M=0 is now at a position of metastable equilibrium. 10 10 ^(10){ }^{10}10 If we take a = a 0 ( T T c ) a = a 0 T T c a=a_(0)(T-T_(c))a=a_{0}\left(T-T_{c}\right)a=a0(TTc) (with a 0 a 0 a_(0)a_{0}a0 a constant) and b ( > 0 ) b ( > 0 ) b( > 0)b(>0)b(>0) to be T T TTT-independent, then clearly a a aaa is positive for T > T c T > T c T > T_(c)T>T_{c}T>Tc and negative for T < T c T < T c T < T_(c)T<T_{c}T<Tc, and so we predict a phase transition at a temperature T = T c T = T c T=T_(c)T=T_{\mathrm{c}}T=Tc.
Fig. 41.3 For temperatures T < T c T < T c T < T_(c)T<T_{\mathrm{c}}T<Tc the spins align along a single direction. The magnetization in (a) is different in (b), where each spin has been rotated by 180 180 180^(@)180^{\circ}180.
8 8 ^(8){ }^{8}8 For the magnet this Hamiltonian could be H ^ = J i S ^ i z S ^ i + 1 z H ^ = J i S ^ i z S ^ i + 1 z hat(H)=-Jsum_(i) hat(S)_(i)^(z) hat(S)_(i+1)^(z)\hat{H}=-J \sum_{i} \hat{S}_{i}^{z} \hat{S}_{i+1}^{z}H^=JiS^izS^i+1z, where J J JJJ is a coupling constant, S ^ i z S ^ i z hat(S)_(i)^(z)\hat{S}_{i}^{z}S^iz is the spin operator for the i i iii th spin and the up direction is along z z zzz. This Hamiltonian rection is along 2 . This either favouring the spins pointing up or favouring them pointing down, only for them pointing in the same direction.


(c)
Fig. 41.4 The Euler strut. (a) This has a vertical axis of symmetry and can buckle either (b) one way or (c) the other, in each case breaking symmetry. 9 9 ^(9){ }^{9}9 Of course, in real life, some very small perturbation or fluctuation will have tipped the system towards its choice of

Fig. 41.5 The Landau free energy for (a) T > T c T > T c T > T_(c)T>T_{\mathrm{c}}T>Tc with a minimum at M = M = M=M=M= 0 and (b) for T < T c T < T c T < T_(c)T<T_{\mathrm{c}}T<Tc with minima at ± M 0 ± M 0 +-M_(0)\pm M_{0}±M0.
10 10 ^(10){ }^{10}10 This means that any slight perturbation of the system which has been prepared in a M = 0 M = 0 M=0M=0M=0 state will drop it into one, or other, of the new minima that have emerged at ± M 0 ± M 0 +-M_(0)\pm M_{0}±M0.
Example 41.2
We can find the minima of F F FFF straightforwardly for T < T c T < T c T < T_(c)T<T_{\mathrm{c}}T<Tc. We set
(41.3) F M = 2 a M + 4 b M 3 = 0 (41.3) F M = 2 a M + 4 b M 3 = 0 {:(41.3)(del F)/(del M)=2aM+4bM^(3)=0:}\begin{equation*} \frac{\partial F}{\partial M}=2 a M+4 b M^{3}=0 \tag{41.3} \end{equation*}(41.3)FM=2aM+4bM3=0
from which we conclude that the minima occur at M = M 0 M = M 0 M=M_(0)M=M_{0}M=M0 where
(41.4) M 0 2 = a 2 b (41.4) M 0 2 = a 2 b {:(41.4)M_(0)^(2)=-(a)/(2b):}\begin{equation*} M_{0}^{2}=-\frac{a}{2 b} \tag{41.4} \end{equation*}(41.4)M02=a2b
The square root gives a real M 0 M 0 M_(0)M_{0}M0 since a < 0 a < 0 a < 0a<0a<0. There is also a maximum at M = 0 M = 0 M=0M=0M=0.
For T < T c T < T c T < T_(c)T<T_{\mathrm{c}}T<Tc we have two minima at M = ± M 0 M = ± M 0 M=+-M_(0)M= \pm M_{0}M=±M0. The system will be propelled into one of these as its new ground state and the symmetry will be broken.
Let's now apply the arguments developed for magnets to our Lagrangian treatment of field theory. Here instead of following the ground state magnetization, we're interested in the equilibrium state value of ϕ ( x ) ϕ ( x ) phi(x)\phi(x)ϕ(x), which is the order parameter for the field theory. We start with a scalar field theory
(41.5) L = 1 2 ( μ ϕ ) 2 U ( ϕ ) , (41.5) L = 1 2 μ ϕ 2 U ( ϕ ) , {:(41.5)L=-(1)/(2)(del_(mu)phi)^(2)-U(phi)",":}\begin{equation*} \mathcal{L}=-\frac{1}{2}\left(\partial_{\mu} \phi\right)^{2}-U(\phi), \tag{41.5} \end{equation*}(41.5)L=12(μϕ)2U(ϕ),
where we've split off all of the potential-energy-like terms and called them the potential energy density U ( ϕ ) U ( ϕ ) U(phi)U(\phi)U(ϕ). For the so-called ϕ 4 ϕ 4 phi^(4)\phi^{4}ϕ4 theory we have that
(41.6) U ( ϕ ) = μ 2 2 ϕ 2 + λ 4 ! ϕ 4 , (41.6) U ( ϕ ) = μ 2 2 ϕ 2 + λ 4 ! ϕ 4 , {:(41.6)U(phi)=(mu^(2))/(2)phi^(2)+(lambda)/(4!)phi^(4)",":}\begin{equation*} U(\phi)=\frac{\mu^{2}}{2} \phi^{2}+\frac{\lambda}{4!} \phi^{4}, \tag{41.6} \end{equation*}(41.6)U(ϕ)=μ22ϕ2+λ4!ϕ4,
where μ μ mu\muμ and λ ( > 0 ) λ ( > 0 ) lambda( > 0)\lambda(>0)λ(>0) are parameters. This function resembles the free energy in our magnet example above. It admits the global symmetry ϕ ( x ) ϕ ( x ) ϕ ( x ) ϕ ( x ) phi(x)rarr-phi(x)\phi(x) \rightarrow-\phi(x)ϕ(x)ϕ(x). Assuming μ 2 μ 2 mu^(2)\mu^{2}μ2 is positive we have a potential with a minimum at ϕ = 0 ϕ = 0 phi=0\phi=0ϕ=0 as shown in Fig. 41.6(a). The minimum corresponds to a ground state, also known as the vacuum, of ϕ ( x ) = 0 ϕ ( x ) = 0 phi(x)=0\phi(x)=0ϕ(x)=0.
What if we swap the sign in front of μ 2 μ 2 mu^(2)\mu^{2}μ2 ? In that case we have U ( ϕ ) = U ( ϕ ) = U(phi)=U(\phi)=U(ϕ)= μ 2 2 ϕ 2 + λ 4 ! ϕ 4 μ 2 2 ϕ 2 + λ 4 ! ϕ 4 -(mu^(2))/(2)*phi^(2)+(lambda)/(4!)phi^(4)-\frac{\mu^{2}}{2} \cdot \phi^{2}+\frac{\lambda}{4!} \phi^{4}μ22ϕ2+λ4!ϕ4, and a potential that looks like Fig. 41.6(b). The minima 11 11 ^(11){ }^{11}11 of the potential now occur at
(41.7) ϕ 0 = ± ( 6 μ 2 λ ) 1 2 (41.7) ϕ 0 = ± 6 μ 2 λ 1 2 {:(41.7)phi_(0)=+-((6mu^(2))/(lambda))^((1)/(2)):}\begin{equation*} \phi_{0}= \pm\left(\frac{6 \mu^{2}}{\lambda}\right)^{\frac{1}{2}} \tag{41.7} \end{equation*}(41.7)ϕ0=±(6μ2λ)12
so we have the choice of two new vacua. Once the temperature has been lowered such that we have the characteristic broken-symmetry potential, we imagine the system lowering its energy by adopting one of the states ϕ 0 ϕ 0 phi_(0)\phi_{0}ϕ0 consistent with one of the minima. This can be thought of as the system rolling down the potential towards a minimum of the potential function U ( ϕ 0 ) = U 0 U ϕ 0 = U 0 U(phi_(0))=U_(0)U\left(\phi_{0}\right)=U_{0}U(ϕ0)=U0.
This mechanism, involving a scalar field that breaks a symmetry, is the one responsible for the Higgs mechanism that leads to the generation of mass in the electroweak part of the Standard Model. Since there is good evidence for the existence of a scalar Higgs field, there is certainly motivation to examine the effect of scalar fields on the expansion of the Universe. This combination of a field and an expanding spacetime will provide a model of inflation.

41.2 Effective potentials

Our model for inflation involves the phase transition of a scalar field similar to that discussed above. 12 12 ^(12){ }^{12}12 Let's consider a scalar field ϕ ( x ) ϕ ( x ) phi(x)\phi(x)ϕ(x) in the Universe before the GUT phase transition. We'll use the ϕ 4 ϕ 4 phi^(4)\phi^{4}ϕ4 Lagrangian
(41.8) L = 1 2 ( μ ϕ ) 2 μ 2 ( T ) 2 ϕ 2 λ 4 ! ϕ 4 , (41.8) L = 1 2 μ ϕ 2 μ 2 ( T ) 2 ϕ 2 λ 4 ! ϕ 4 , {:(41.8)L=-(1)/(2)(del_(mu)phi)^(2)-(mu^(2)(T))/(2)phi^(2)-(lambda)/(4!)phi^(4)",":}\begin{equation*} \mathcal{L}=-\frac{1}{2}\left(\partial_{\mu} \phi\right)^{2}-\frac{\mu^{2}(T)}{2} \phi^{2}-\frac{\lambda}{4!} \phi^{4}, \tag{41.8} \end{equation*}(41.8)L=12(μϕ)2μ2(T)2ϕ2λ4!ϕ4,
so that the potential U = μ 2 ( T ) 2 ϕ 2 + λ 4 ! ϕ 4 U = μ 2 ( T ) 2 ϕ 2 + λ 4 ! ϕ 4 U=(mu^(2)(T))/(2)phi^(2)+(lambda)/(4!)phi^(4)U=\frac{\mu^{2}(T)}{2} \phi^{2}+\frac{\lambda}{4!} \phi^{4}U=μ2(T)2ϕ2+λ4!ϕ4. This field has an equation of motion
(41.9) ϕ ; μ ν g μ ν = U ϕ = μ 2 ( T ) ϕ 2 + λ 3 ! ϕ 3 . (41.9) ϕ ; μ ν g μ ν = U ϕ = μ 2 ( T ) ϕ 2 + λ 3 ! ϕ 3 . {:(41.9)phi_(;mu nu)g^(mu nu)=(del U)/(del phi)=mu^(2)(T)phi^(2)+(lambda)/(3!)phi^(3).:}\begin{equation*} \phi_{; \mu \nu} g^{\mu \nu}=\frac{\partial U}{\partial \phi}=\mu^{2}(T) \phi^{2}+\frac{\lambda}{3!} \phi^{3} . \tag{41.9} \end{equation*}(41.9)ϕ;μνgμν=Uϕ=μ2(T)ϕ2+λ3!ϕ3.
In eqn 41.8, T ( t ) T ( t ) T(t)T(t)T(t) is the time-dependent temperature which causes μ 2 ( T ) μ 2 ( T ) mu^(2)(T)\mu^{2}(T)μ2(T) to change sign as a function of time below a critical temperature T c T c T_(c)T_{\mathrm{c}}Tc. At this point the field ϕ ϕ phi\phiϕ undergoes a phase transition. In the cosmology we're dealing with, we shall assume that the field ϕ ϕ phi\phiϕ is always uniform in space, so that ϕ = 0 ϕ = 0 vec(grad)phi=0\vec{\nabla} \phi=0ϕ=0 at all times. Crucially, ϕ ϕ phi\phiϕ will vary as a function of time as the phase transition takes place.
We shall now show how the expansion of a Robertson-Walker spacetime causes a frictional term in the equation of motion for ϕ ϕ phi\phiϕ, that depends on the rate of expansion of spacetime. The energy-density and pressure of the ϕ ϕ phi\phiϕ field must also have an effect on the evolution of the Universe, determining the rate of expansion. Taken together, the resulting coupled equations of motion can lead to the rapid expansion of the Universe we're looking for. In the example below, we shall simplify the description for that appropriate to a flat Universe. 13 13 ^(13){ }^{13}13

Example 41.3

Consider the scalar field Lagrangian
(41.10) L = 1 2 ( μ ϕ ) 2 U ( ϕ ) , (41.10) L = 1 2 μ ϕ 2 U ( ϕ ) , {:(41.10)L=-(1)/(2)(del_(mu)phi)^(2)-U(phi)",":}\begin{equation*} \mathcal{L}=-\frac{1}{2}\left(\partial_{\mu} \phi\right)^{2}-U(\phi), \tag{41.10} \end{equation*}(41.10)L=12(μϕ)2U(ϕ),
in a spatially flat Robertson-Walker spacetime 14 14 ^(14){ }^{14}14 with diagonal metric g t t = 1 g t t = 1 g^(tt)=-1g^{t t}=-1gtt=1 and g i i = 1 / a ( t ) 2 g i i = 1 / a ( t ) 2 g^(ii)=1//a(t)^(2)g^{i i}=1 / a(t)^{2}gii=1/a(t)2. The equations of motion are
(41.11) ϕ ; μ ν g μ ν = U ( ϕ ) ϕ (41.11) ϕ ; μ ν g μ ν = U ( ϕ ) ϕ {:(41.11)phi_(;mu nu)g^(mu nu)=(del U(phi))/(del phi):}\begin{equation*} \phi_{; \mu \nu} g^{\mu \nu}=\frac{\partial U(\phi)}{\partial \phi} \tag{41.11} \end{equation*}(41.11)ϕ;μνgμν=U(ϕ)ϕ
and since ϕ ϕ phi\phiϕ is a scalar we have ϕ ; μ = μ ϕ ϕ ; μ = μ ϕ phi_(;mu)=del_(mu)phi\phi_{; \mu}=\partial_{\mu} \phiϕ;μ=μϕ. We can then compute the second covariant derivative as
(41.12) ϕ ; μ ν = ( μ ϕ ) ; ν = ν μ ϕ Γ ν μ σ σ ϕ . (41.12) ϕ ; μ ν = μ ϕ ; ν = ν μ ϕ Γ ν μ σ σ ϕ . {:(41.12)phi_(;mu nu)=(del_(mu)phi)_(;nu)=del_(nu)del_(mu)phi-Gamma_(nu mu)^(sigma)del_(sigma)phi.:}\begin{equation*} \phi_{; \mu \nu}=\left(\partial_{\mu} \phi\right)_{; \nu}=\partial_{\nu} \partial_{\mu} \phi-\Gamma_{\nu \mu}^{\sigma} \partial_{\sigma} \phi . \tag{41.12} \end{equation*}(41.12)ϕ;μν=(μϕ);ν=νμϕΓνμσσϕ.
Since we shall trace over the tensor ϕ ; μ ν ϕ ; μ ν phi_(;mu nu)\phi_{; \mu \nu}ϕ;μν, we need only compute the diagonal elements, and so we find
(41.13) ϕ ; 00 = 2 ϕ t 2 , ϕ ; i i = i 2 ϕ a ˙ a ϕ t , (41.13) ϕ ; 00 = 2 ϕ t 2 , ϕ ; i i = i 2 ϕ a ˙ a ϕ t , {:(41.13)phi_(;00)=(del^(2)phi)/(delt^(2))","quadphi_(;ii)=del_(i)^(2)phi-a^(˙)a(del phi)/(del t)",":}\begin{equation*} \phi_{; 00}=\frac{\partial^{2} \phi}{\partial t^{2}}, \quad \phi_{; i i}=\partial_{i}^{2} \phi-\dot{a} a \frac{\partial \phi}{\partial t}, \tag{41.13} \end{equation*}(41.13)ϕ;00=2ϕt2,ϕ;ii=i2ϕa˙aϕt,
and since we only deal with slow variations in space, we retain the time derivatives only. Tracing using our metric g μ ν g μ ν g^(mu nu)g^{\mu \nu}gμν we obtain
(41.14) 2 ϕ t 2 + 3 1 a a t ϕ t + U ϕ = 0 (41.14) 2 ϕ t 2 + 3 1 a a t ϕ t + U ϕ = 0 {:(41.14)(del^(2)phi)/(delt^(2))+3(1)/(a)(del a)/(del t)(del phi)/(del t)+(del U)/(del phi)=0:}\begin{equation*} \frac{\partial^{2} \phi}{\partial t^{2}}+3 \frac{1}{a} \frac{\partial a}{\partial t} \frac{\partial \phi}{\partial t}+\frac{\partial U}{\partial \phi}=0 \tag{41.14} \end{equation*}(41.14)2ϕt2+31aatϕt+Uϕ=0
12 12 ^(12){ }^{12}12 This field should not be identified with the Higgs field, but is rather some other field in the Universe. Confusingly, it is sometime called a Higgs field. has Γ 0 i j = a a ˙ δ i j Γ 0 i j = a a ˙ δ i j Gamma^(0)_(ij)=aa^(˙)delta_(ij)\Gamma^{0}{ }_{i j}=a \dot{a} \delta_{i j}Γ0ij=aa˙δij and Γ j 0 i = ( a ˙ / a ) δ j i Γ j 0 i = ( a ˙ / a ) δ j i Gamma_(j0)^(i)=(a^(˙)//a)delta_(j)^(i)\Gamma_{j 0}^{i}=(\dot{a} / a) \delta_{j}^{i}Γj0i=(a˙/a)δji.
The second term of the latter equation supplies an effective drag factor 3 H = 3 a ˙ α ˙ 3 H = 3 a ˙ α ˙ 3H=3((a^(˙)))/((alpha^(˙)))3 H=3 \frac{\dot{a}}{\dot{\alpha}}3H=3a˙α˙ that damps the motion of the ϕ ϕ phi\phiϕ field. Our theory has an energy-momentum tensor given by
(41.15) T μ ν = μ ϕ ν ϕ g μ ν [ 1 2 g α β α ϕ β ϕ + U ( ϕ ) ] (41.15) T μ ν = μ ϕ ν ϕ g μ ν 1 2 g α β α ϕ β ϕ + U ( ϕ ) {:(41.15)T_(mu nu)=del_(mu)phidel_(nu)phi-g_(mu nu)[(1)/(2)g^(alpha beta)del_(alpha)phidel_(beta)phi+U(phi)]:}\begin{equation*} T_{\mu \nu}=\partial_{\mu} \phi \partial_{\nu} \phi-g_{\mu \nu}\left[\frac{1}{2} g^{\alpha \beta} \partial_{\alpha} \phi \partial_{\beta} \phi+U(\phi)\right] \tag{41.15} \end{equation*}(41.15)Tμν=μϕνϕgμν[12gαβαϕβϕ+U(ϕ)]
Again assuming spatially uniform ϕ ϕ phi\phiϕ, and working in the local frame of an observer, we can pick out an energy-density
(41.16) ρ = 1 2 ϕ ˙ 2 + U ( ϕ ) , (41.16) ρ = 1 2 ϕ ˙ 2 + U ( ϕ ) , {:(41.16)rho=(1)/(2)*phi^(˙)^(2)+U(phi)",":}\begin{equation*} \rho=\frac{1}{2} \cdot \dot{\phi}^{2}+U(\phi), \tag{41.16} \end{equation*}(41.16)ρ=12ϕ˙2+U(ϕ),
and also a pressure
(41.17) p = 1 2 ϕ ˙ 2 U ( ϕ ) . (41.17) p = 1 2 ϕ ˙ 2 U ( ϕ ) . {:(41.17)p=(1)/(2)*phi^(˙)^(2)-U(phi).:}\begin{equation*} p=\frac{1}{2} \cdot \dot{\phi}^{2}-U(\phi) . \tag{41.17} \end{equation*}(41.17)p=12ϕ˙2U(ϕ).
In the event that the fields are constant in time, we have ρ = p ρ = p rho=-p\rho=-pρ=p, which, comparing to the case of a fluid 15 15 ^(15){ }^{15}15 gives us an effective cosmological constant Λ = 8 π U ( ϕ ) Λ = 8 π U ( ϕ ) Lambda=8pi U(phi)\Lambda=8 \pi U(\phi)Λ=8πU(ϕ). This is promising, since it implies that the non-zero scalar field can potentially cause the sort of rapid expansion we saw in the de Sitter spacetime of Universe 1 in Chapter 15. We also need to find the equation of motion for a ( t ) a ( t ) a(t)a(t)a(t). The Friedmann equation for k = 0 k = 0 k=0k=0k=0 tells us that
(41.19) ( a ˙ a ) 2 = 8 π ρ 3 (41.19) a ˙ a 2 = 8 π ρ 3 {:(41.19)(((a^(˙)))/(a))^(2)=(8pi rho)/(3):}\begin{equation*} \left(\frac{\dot{a}}{a}\right)^{2}=\frac{8 \pi \rho}{3} \tag{41.19} \end{equation*}(41.19)(a˙a)2=8πρ3
or, using our result for ρ ρ rho\rhoρ for the scalar field
(41.20) ( a ˙ a ) 2 = 8 π 3 [ 1 2 ϕ ˙ 2 + U ( ϕ ) ] . (41.20) a ˙ a 2 = 8 π 3 1 2 ϕ ˙ 2 + U ( ϕ ) . {:(41.20)(((a^(˙)))/(a))^(2)=(8pi)/(3)*[(1)/(2)*phi^(˙)^(2)+U(phi)].:}\begin{equation*} \left(\frac{\dot{a}}{a}\right)^{2}=\frac{8 \pi}{3} \cdot\left[\frac{1}{2} \cdot \dot{\phi}^{2}+U(\phi)\right] . \tag{41.20} \end{equation*}(41.20)(a˙a)2=8π3[12ϕ˙2+U(ϕ)].
This completes the necessary ingredients to describe inflation.
The two most important equations from our analysis of the scalar field in an expanding universe are then: (i) the damped equation of motion for the ϕ ϕ phi\phiϕ field
(41.21) ϕ ¨ + 3 a ˙ a ϕ ˙ + U ϕ = 0 (41.21) ϕ ¨ + 3 a ˙ a ϕ ˙ + U ϕ = 0 {:(41.21)phi^(¨)+3((a^(˙)))/(a)phi^(˙)+(del U)/(del phi)=0:}\begin{equation*} \ddot{\phi}+3 \frac{\dot{a}}{a} \dot{\phi}+\frac{\partial U}{\partial \phi}=0 \tag{41.21} \end{equation*}(41.21)ϕ¨+3a˙aϕ˙+Uϕ=0
and (ii) the equation of motion for the expansion factor of a Universe filled with ϕ ϕ phi\phiϕ-field energy density
(41.22) ( a ˙ a ) 2 = 8 π 3 ( 1 2 ϕ ˙ + U ) (41.22) a ˙ a 2 = 8 π 3 1 2 ϕ ˙ + U {:(41.22)(((a^(˙)))/(a))^(2)=(8pi)/(3)((1)/(2)(phi^(˙))+U):}\begin{equation*} \left(\frac{\dot{a}}{a}\right)^{2}=\frac{8 \pi}{3}\left(\frac{1}{2} \dot{\phi}+U\right) \tag{41.22} \end{equation*}(41.22)(a˙a)2=8π3(12ϕ˙+U)
The key now is to look at the phase transition in the ϕ ϕ phi\phiϕ field and how ϕ ϕ phi\phiϕ makes its way to its new minimum as a function of time for T < T c T < T c T < T_(c)T<T_{\mathrm{c}}T<Tc. To make the description simple we make the slow-rolling approximation, that at temperatures below the transition, the ϕ ϕ phi\phiϕ field will change slowly in time. This requires a shallow potential U ( ϕ ) U ( ϕ ) U(phi)U(\phi)U(ϕ), as shown in Fig. 41.7.
Slow rolling means that we have a negligible value of ϕ ¨ ϕ ¨ phi^(¨)\ddot{\phi}ϕ¨ so that the equation of motion for the ϕ ϕ phi\phiϕ field (eqn 41.21 ) becomes 3 ( a ˙ / a ) ϕ ˙ = 3 ( a ˙ / a ) ϕ ˙ = 3(a^(˙)//a)phi^(˙)=3(\dot{a} / a) \dot{\phi}=3(a˙/a)ϕ˙= U / ϕ U / ϕ -del U//del phi-\partial U / \partial \phiU/ϕ. Near the transition the flat potential takes a near-constant value U ( ϕ ) U ( ϕ 0 ) = U 0 U ( ϕ ) U ϕ 0 = U 0 U(phi)~~U(phi_(0))=U_(0)U(\phi) \approx U\left(\phi_{0}\right)=U_{0}U(ϕ)U(ϕ0)=U0, which implies that ϕ ˙ ϕ ˙ phi^(˙)\dot{\phi}ϕ˙ also becomes small. What we're really interested in is the equation of motion for the expansion factor of the Universe a ( t ) a ( t ) a(t)a(t)a(t), and so, considering eqn 41.22 with small ϕ ˙ ϕ ˙ phi^(˙)\dot{\phi}ϕ˙, we deduce that
(41.23) a ˙ a ( 8 π U 0 3 ) 1 2 (41.23) a ˙ a 8 π U 0 3 1 2 {:(41.23)((a^(˙)))/(a)~~((8piU_(0))/(3))^((1)/(2)):}\begin{equation*} \frac{\dot{a}}{a} \approx\left(\frac{8 \pi U_{0}}{3}\right)^{\frac{1}{2}} \tag{41.23} \end{equation*}(41.23)a˙a(8πU03)12
The solution to this latter equation is an exponential growth of a ( t ) a ( t ) a(t)a(t)a(t) with
(41.24) a ( t ) exp [ ( 8 π U 0 3 ) 1 2 t ] (41.24) a ( t ) exp 8 π U 0 3 1 2 t {:(41.24)a(t)prop exp[((8piU_(0))/(3))^((1)/(2))t]:}\begin{equation*} a(t) \propto \exp \left[\left(\frac{8 \pi U_{0}}{3}\right)^{\frac{1}{2}} t\right] \tag{41.24} \end{equation*}(41.24)a(t)exp[(8πU03)12t]
The theory therefore predicts an inflationary period where, owing to the behaviour of the ϕ ϕ phi\phiϕ field close to its phase transition and the consequences on its energy density, the expansion factor a ( t ) a ( t ) a(t)a(t)a(t) of the Universe increases exponentially.

41.3 Why flat?

In the above discussion, we have assumed the starting point of inflation was a k = 0 k = 0 k=0k=0k=0 universe. 16 16 ^(16){ }^{16}16 In this section, we shall see that the result of inflation is a flat Universe and, in fact, the end result of an inflationary expansion is always a flat space, no matter which curvature we start with. To understand this we first note that rapid expansion is always seen in models with a cosmological constant Λ Λ Lambda\LambdaΛ and we can therefore compare our exponential expansion to these other analogous universes with cosmological-constant-driven expansion. These models predict expansions
(41.25) a ( t ) { cosh H t ( k = + 1 ) , e H t ( k = 0 ) , sinh H t ( k = 1 ) , (41.25) a ( t ) cosh H t ( k = + 1 ) , e H t ( k = 0 ) , sinh H t ( k = 1 ) , {:(41.25)a(t)prop{[cosh Ht,(k=+1)","],[e^(Ht),(k=0)","],[sinh Ht,(k=-1)","]:}:}a(t) \propto \begin{cases}\cosh H t & (k=+1), \tag{41.25}\\ \mathrm{e}^{H t} & (k=0), \\ \sinh H t & (k=-1),\end{cases}(41.25)a(t){coshHt(k=+1),eHt(k=0),sinhHt(k=1),
with H 2 = Λ / 3 H 2 = Λ / 3 H^(2)=Lambda//3H^{2}=\Lambda / 3H2=Λ/3. Owing to the behaviour of the hyperbolic functions, each of these cases evolve towards an exponential expansion. That is, they are eventually all consistent with the Ω = 1 Ω = 1 Omega=1\Omega=1Ω=1 case that results in k = 0 k = 0 k=0k=0k=0 flat space. So, even if Ω Ω Omega\OmegaΩ is not initially set close to 1 , then exponential expansion forces Ω Ω Omega\OmegaΩ towards this value very rapidly.
The Friedmann equations tell us that in the absence of other sources of cosmological constant Λ Λ Lambda\LambdaΛ, the acceleration of the Universe obeys
(41.26) a ¨ a = 4 π ( p + ρ 3 ) , (41.26) a ¨ a = 4 π p + ρ 3 , {:(41.26)((a^(¨)))/(a)=-4pi(p+(rho)/(3))",":}\begin{equation*} \frac{\ddot{a}}{a}=-4 \pi\left(p+\frac{\rho}{3}\right), \tag{41.26} \end{equation*}(41.26)a¨a=4π(p+ρ3),
so in order to achieve the positive acceleration needed for inflation we require p < ρ / 3 p < ρ / 3 p < -rho//3p<-\rho / 3p<ρ/3. In terms of the estimates for p p ppp and ρ ρ rho\rhoρ from our scalar field (eqns 41.16 and 41.17), this condition is equivalent to
(41.27) ϕ ˙ 2 < U ( ϕ ) . (41.27) ϕ ˙ 2 < U ( ϕ ) . {:(41.27)phi^(˙)^(2) < U(phi).:}\begin{equation*} \dot{\phi}^{2}<U(\phi) . \tag{41.27} \end{equation*}(41.27)ϕ˙2<U(ϕ).
The condition for expansion provides a means for inflation to cease during the rolling of the field ϕ ϕ phi\phiϕ along the potential: that is, when the speed becomes large enough and/or the value of U ( ϕ ) U ( ϕ ) U(phi)U(\phi)U(ϕ) is sufficiently small. One possibility is that ϕ ϕ phi\phiϕ reach the minimum at U 0 U 0 U_(0)U_{0}U0 and then oscillates about this value as the damping term reduces the kinetic energy 1 2 ϕ ˙ 2 1 2 ϕ ˙ 2 (1)/(2)phi^(˙)^(2)\frac{1}{2} \dot{\phi}^{2}12ϕ˙2 until the field is at rest. If U 0 < 0 U 0 < 0 U_(0) < 0U_{0}<0U0<0 at the minimum then inflation stops. 17 17 ^(17){ }^{17}17 In practice, the presence of coupling to other fields will cause reheating, where
16 16 ^(16){ }^{16}16 Inflation occurs at the start of the expansion of the Universe so the effect of a particular value of k k kkk will be difficult to detect before the Universe has evolved sufficiently. Other values of k k kkk can be treated in the same spirit, but the differences in the predictions but the differences in the predictions
are small compared to the observables. are small compared to the observables.
Treating k = 0 k = 0 k=0k=0k=0 is therefore a good approximation.
17 17 ^(17){ }^{17}17 The condition therefore also provides a means for inflation to continue indefa means for inflation to continue indef-
initely if the field comes to rest at the initely if the field come
minimum and U 0 > 0 U 0 > 0 U_(0) > 0U_{0}>0U0>0.
the scalar field decays during the roll, producing other particle excitations. At the end of the period of inflation we are then left with a flat Universe filled with normal matter and radiation, allowing a standardmodel expansion of the Universe afterwards.
Although the inflationary model has been hugely influential over the last few decades, it is not accepted by absolutely all cosmologists. Is it the correct model of the early Universe? It is probably fair to say that the jury is still out. However, cosmic inflation remains the reigning paradigm. As we've seen in this chapter, it solves some knotty problems in cosmology and it is likely to continue to remain our best picture of the Universe just after the Big Bang, unless of course some other revolutionary new idea or piece of contradictory observational evidence turns up. We discuss the current status of the field in Chapter 49.

Chapter summary

  • To solve the flatness and horizon problems of cosmology, inflation theory proposes a rapid period of expansion in the early life of the Universe.
  • The theory relies on the notion of a symmetry-breaking phase transition that takes place as the Universe cools.
  • The slow-rolling approximation predicts an exponential expansion of the Universe for a limited period of time, leading to a spatially flat Universe at the end of the expansion.

Exercises

(41.1) (a) Compute the equation of motion for a scalar field Φ Φ Phi\PhiΦ described by the Lagrangian in eqn 41.5 in a Robertson-Walker universe with k 0 k 0 k!=0k \neq 0k0.
Use the connection coefficients given in Chapter 16 (b) Verify that the equation of motion reduces to the expected form for k = 0 k = 0 k=0k=0k=0.

The electromagnetic field

Electrical force is defined as something which causes motion of electrical charge; an electrical charge is something which exerts electric force.
Arthur Eddington (1882-1944) The Nature of the Physical World
O'er the wires the electric message came,
'He is no better; he is much the same.'
Alfred Austin (1835-1913) (attrib.) On the Illness of the Prince of Wales
In this chapter, we use the geometrical techniques from the previous part of the book to reformulate electromagnetism. In some ways, electromagnetism can be viewed as one of the simplest and most successful field theories in Nature. Einstein certainly thought so, and would return to electromagnetism regularly for inspiration throughout his formulation of relativity. In much the same way that in Chapter 0, where we characterized gravitation with John Wheeler's slogan
spacetime tells matter how to move; matter tells spacetime how to curve,
we can similarly sloganize electromagnetism by saying
electromagnetic fields tells electric charges how to move; electric charges tells electromagnetic fields where to go.
We shall return to the analogy between electromagnetism and gravitation repeatedly in the coming chapters. This chapter aims to set the scene. We start by working in flat, Minkowski spacetime. 1 1 ^(1){ }^{1}1 We shall produce a Lagrangian description of electromagnetism in terms of the components of its field. We then formulate a description of electromagnetism using the coordinate-free, geometrical machinery of tensors.

42.1 Electric charge in a field

A charge can be thought of as that property of a particle that couples to a field. How could a tensor field cause a charge to move? We have seen the answer for the case of the metric field where curvature causes the movement of massive particles. However, there are other ways that

42.1 Electric charge in a field 453 42.2 Faraday tensor and Maxwell equations 455 42.3 Gauge freedom 458 42.4 Geometrical electromagnetism 460 Chapter summary 464 Exercises 464
2 2 ^(2){ }^{2}2 Recall that the Lagrangian description of fields relies on the components of tensors. We return to our coordinatefree, geometrical description below.
a field can cause a change in motion of a charge. Let's ask what the simplest method of coupling a particle and a field might be. This, it turns out, is how electromagnetism manifests itself in Nature,
The relativistic action for a free massive particle is given by
(42.1) S m = m d s 2 (42.1) S m = m d s 2 {:(42.1)S_(m)=-m intsqrt(-ds^(2)):}\begin{equation*} S_{\mathrm{m}}=-m \int \sqrt{-\mathrm{d} s^{2}} \tag{42.1} \end{equation*}(42.1)Sm=mds2
where in flat, Minkowski space the interval d s 2 = η μ ν d x μ d x ν d s 2 = η μ ν d x μ d x ν ds^(2)=eta_(mu nu)dx^(mu)dx^(nu)\mathrm{d} s^{2}=\eta_{\mu \nu} \mathrm{d} x^{\mu} \mathrm{d} x^{\nu}ds2=ημνdxμdxν. We would like this particle to interact with another field. Perhaps the simplest coupling that obeys Lorentz symmetry couples the displacement of the particle d x μ d x μ dx^(mu)\mathrm{d} x^{\mu}dxμ to the components of a 1 -form field A ~ ( x ) A ~ ( x ) tilde(A)(x)\tilde{\boldsymbol{A}}(x)A~(x). This results in a contribution to the action of 2 2 ^(2){ }^{2}2
(42.2) S em = q A σ ( x ) d x σ , (42.2) S em = q A σ ( x ) d x σ , {:(42.2)S_(em)=q intA_(sigma)(x)dx^(sigma)",":}\begin{equation*} S_{\mathrm{em}}=q \int A_{\sigma}(x) \mathrm{d} x^{\sigma}, \tag{42.2} \end{equation*}(42.2)Sem=qAσ(x)dxσ,
where the charge q q qqq tells us the strength of the coupling between the field and the displacement. This action describes the interaction between the electromagnetic field 3 A ~ ( x ) 3 A ~ ( x ) ^(3) tilde(A)(x){ }^{3} \tilde{\boldsymbol{A}}(x)3A~(x) and a particle with electric charge q q qqq. The particle is assumed massive and so we choose to parametrize its path with an affine parameter τ τ tau\tauτ. We then have a total action S S SSS for a particle in the field A ~ ( x ) A ~ ( x ) tilde(A)(x)\tilde{\boldsymbol{A}}(x)A~(x) given by the sum of the free-particle action and the interaction of the particle and the electromagnetic field 4 S = S m + S em 4 S = S m + S em ^(4)S=S_(m)+S_(em){ }^{4} S=S_{\mathrm{m}}+S_{\mathrm{em}}4S=Sm+Sem or
(42.3) S = [ m ( η μ ν d x μ d τ d x ν d τ ) 1 2 + q A σ d x σ d τ ] d τ (42.3) S = m η μ ν d x μ d τ d x ν d τ 1 2 + q A σ d x σ d τ d τ {:(42.3)S=int[-m(-eta_(mu nu)(dx^(mu))/(dtau)((d)x^(nu))/(dtau))^((1)/(2))+qA_(sigma)(dx^(sigma))/(dtau)]dtau:}\begin{equation*} S=\int\left[-m\left(-\eta_{\mu \nu} \frac{\mathrm{d} x^{\mu}}{\mathrm{d} \tau} \frac{\mathrm{~d} x^{\nu}}{\mathrm{d} \tau}\right)^{\frac{1}{2}}+q A_{\sigma} \frac{\mathrm{d} x^{\sigma}}{\mathrm{d} \tau}\right] \mathrm{d} \tau \tag{42.3} \end{equation*}(42.3)S=[m(ημνdxμdτ dxνdτ)12+qAσdxσdτ]dτ
This action can be used to derive the equations of motion for the particle.

Example 42.1

If the action is given by S = L d τ S = L d τ S=int LdtauS=\int L \mathrm{~d} \tauS=L dτ, then we can write the Lagrangian as L = L = L=L=L= L m + L em L m + L em L_(m)+L_(em)L_{\mathrm{m}}+L_{\mathrm{em}}Lm+Lem. We then utilize the Euler-Lagrange equations for each part in turn. The kinetic energy contribution of the free particle is
L m ( d x α d τ ) = m 2 L m ( η μ ν δ μ α d x ν d τ + η μ ν d x μ d τ δ ν α ) (42.4) = m L m η μ α d x μ d τ . L m d x α d τ = m 2 L m η μ ν δ μ α d x ν d τ + η μ ν d x μ d τ δ ν α (42.4) = m L m η μ α d x μ d τ . {:[(delL_(m))/(del(((d)x^(alpha))/(dtau)))=(m)/(2L_(m))*(eta_(mu nu)delta^(mu)_(alpha)(dx^(nu))/(dtau)*+eta_(mu nu)(dx^(mu))/(dtau)*delta^(nu)_(alpha))],[(42.4)=(m)/(L_(m))eta_(mu alpha)(dx^(mu))/(dtau).]:}\begin{align*} \frac{\partial L_{\mathrm{m}}}{\partial\left(\frac{\mathrm{~d} x^{\alpha}}{\mathrm{d} \tau}\right)} & =\frac{m}{2 L_{\mathrm{m}}} \cdot\left(\eta_{\mu \nu} \delta^{\mu}{ }_{\alpha} \frac{\mathrm{d} x^{\nu}}{\mathrm{d} \tau} \cdot+\eta_{\mu \nu} \frac{\mathrm{d} x^{\mu}}{\mathrm{d} \tau} \cdot \delta^{\nu}{ }_{\alpha}\right) \\ & =\frac{m}{L_{\mathrm{m}}} \eta_{\mu \alpha} \frac{\mathrm{d} x^{\mu}}{\mathrm{d} \tau} . \tag{42.4} \end{align*}Lm( dxαdτ)=m2Lm(ημνδμαdxνdτ+ημνdxμdτδνα)(42.4)=mLmημαdxμdτ.
Using the usual routine for length parametrization we have
(42.5) d d τ L m ( d x α d τ ) = m η μ α d 2 x μ d τ 2 (42.5) d d τ L m d x α d τ = m η μ α d 2 x μ d τ 2 {:(42.5)(d)/((d)tau)*(delL_(m))/(del(((d)x^(alpha))/(dtau)))=meta_(mu alpha)(d^(2)x^(mu))/(dtau^(2)):}\begin{equation*} \frac{\mathrm{d}}{\mathrm{~d} \tau} \cdot \frac{\partial L_{\mathrm{m}}}{\partial\left(\frac{\mathrm{~d} x^{\alpha}}{\mathrm{d} \tau}\right)}=m \eta_{\mu \alpha} \frac{\mathrm{d}^{2} x^{\mu}}{\mathrm{d} \tau^{2}} \tag{42.5} \end{equation*}(42.5)d dτLm( dxαdτ)=mημαd2xμdτ2
Turning to the second, interaction, term
(42.6) L em ( d x α d τ ) = q A σ δ σ α = q A α , (42.6) L em d x α d τ = q A σ δ σ α = q A α , {:(42.6)(delL_(em))/(del(((d)x^(alpha))/(dtau)))=qA_(sigma)delta^(sigma)_(alpha)=qA_(alpha)",":}\begin{equation*} \frac{\partial L_{\mathrm{em}}}{\partial\left(\frac{\mathrm{~d} x^{\alpha}}{\mathrm{d} \tau}\right)}=q A_{\sigma} \delta^{\sigma}{ }_{\alpha}=q A_{\alpha}, \tag{42.6} \end{equation*}(42.6)Lem( dxαdτ)=qAσδσα=qAα,
and so
(42.7) d d τ L em ( d x α d τ ) = q d A α d τ = q d x β d τ A α x β (42.7) d d τ L em d x α d τ = q d A α d τ = q d x β d τ A α x β {:(42.7)(d)/((d)tau)*(delL_(em))/(del(((d)x^(alpha))/(dtau)))=q((d)A_(alpha))/(dtau)=q((d)x^(beta))/(dtau)(delA_(alpha))/(delx^(beta)):}\begin{equation*} \frac{\mathrm{d}}{\mathrm{~d} \tau} \cdot \frac{\partial L_{\mathrm{em}}}{\partial\left(\frac{\mathrm{~d} x^{\alpha}}{\mathrm{d} \tau}\right)}=q \frac{\mathrm{~d} A_{\alpha}}{\mathrm{d} \tau}=q \frac{\mathrm{~d} x^{\beta}}{\mathrm{d} \tau} \frac{\partial A_{\alpha}}{\partial x^{\beta}} \tag{42.7} \end{equation*}(42.7)d dτLem( dxαdτ)=q dAαdτ=q dxβdτAαxβ
Finally,
(42.8) L em x α = q A σ x α d x σ d τ . (42.8) L em x α = q A σ x α d x σ d τ . {:(42.8)(delL_(em))/(delx^(alpha))=q(delA_(sigma))/(delx^(alpha))(dx^(sigma))/(dtau).:}\begin{equation*} \frac{\partial L_{\mathrm{em}}}{\partial x^{\alpha}}=q \frac{\partial A_{\sigma}}{\partial x^{\alpha}} \frac{\mathrm{d} x^{\sigma}}{\mathrm{d} \tau} . \tag{42.8} \end{equation*}(42.8)Lemxα=qAσxαdxσdτ.
Putting the parts together using the Euler-Lagrange equations gives
(42.9) m η μ α d 2 x μ d τ 2 + q d x β d τ A α x β = q A σ x α d x σ d τ (42.10) m η μ α d 2 x μ d τ 2 = q ( A β x α A α x β ) d x β d τ (42.9) m η μ α d 2 x μ d τ 2 + q d x β d τ A α x β = q A σ x α d x σ d τ (42.10) m η μ α d 2 x μ d τ 2 = q A β x α A α x β d x β d τ {:[(42.9)meta_(mu alpha)(d^(2)x^(mu))/(dtau^(2))+q((d)x^(beta))/(dtau)*(delA_(alpha))/(delx^(beta))=q(delA_(sigma))/(delx^(alpha))(dx^(sigma))/(dtau)],[(42.10)meta_(mu alpha)(d^(2)x^(mu))/(dtau^(2))=q((delA_(beta))/(delx^(alpha))-(delA_(alpha))/(delx^(beta)))(dx^(beta))/(dtau)]:}\begin{align*} & m \eta_{\mu \alpha} \frac{\mathrm{d}^{2} x^{\mu}}{\mathrm{d} \tau^{2}}+q \frac{\mathrm{~d} x^{\beta}}{\mathrm{d} \tau} \cdot \frac{\partial A_{\alpha}}{\partial x^{\beta}}=q \frac{\partial A_{\sigma}}{\partial x^{\alpha}} \frac{\mathrm{d} x^{\sigma}}{\mathrm{d} \tau} \tag{42.9}\\ & m \eta_{\mu \alpha} \frac{\mathrm{d}^{2} x^{\mu}}{\mathrm{d} \tau^{2}}=q\left(\frac{\partial A_{\beta}}{\partial x^{\alpha}}-\frac{\partial A_{\alpha}}{\partial x^{\beta}}\right) \frac{\mathrm{d} x^{\beta}}{\mathrm{d} \tau} \tag{42.10} \end{align*}(42.9)mημαd2xμdτ2+q dxβdτAαxβ=qAσxαdxσdτ(42.10)mημαd2xμdτ2=q(AβxαAαxβ)dxβdτ
or
where we've reindexed the terms in the final line.
We can rewrite the final expression from the last example as 5 5 ^(5){ }^{5}5
(42.12) m η μ α d 2 x μ d τ 2 = q F α β u β (42.12) m η μ α d 2 x μ d τ 2 = q F α β u β {:(42.12)meta_(mu alpha)(d^(2)x^(mu))/(dtau^(2))=qF_(alpha beta)u^(beta):}\begin{equation*} m \eta_{\mu \alpha} \frac{\mathrm{d}^{2} x^{\mu}}{\mathrm{d} \tau^{2}}=q F_{\alpha \beta} u^{\beta} \tag{42.12} \end{equation*}(42.12)mημαd2xμdτ2=qFαβuβ
where u β = d x β / d τ u β = d x β / d τ u^(beta)=dx^(beta)//dtauu^{\beta}=\mathrm{d} x^{\beta} / \mathrm{d} \tauuβ=dxβ/dτ are the components of the velocity, and the interaction is encoded in the tensor components
(42.13) F α β = A β , α A α , β (42.13) F α β = A β , α A α , β {:(42.13)F_(alpha beta)=A_(beta,alpha)-A_(alpha,beta):}\begin{equation*} F_{\alpha \beta}=A_{\beta, \alpha}-A_{\alpha, \beta} \tag{42.13} \end{equation*}(42.13)Fαβ=Aβ,αAα,β
In fact, the F α β F α β F_(alpha beta)F_{\alpha \beta}Fαβ are the components of the Faraday tensor. 6 6 ^(6){ }^{6}6 This is a ( 0 , 2 ) ( 0 , 2 ) (0,2)(0,2)(0,2) tensor which can be written in a basis as
(42.14) F ( , ) = F α β d x α d x β (42.14) F ( , ) = F α β d x α d x β {:(42.14)F(",")=F_(alpha beta)dx^(alpha)ox dx^(beta):}\begin{equation*} \boldsymbol{F}(,)=F_{\alpha \beta} \boldsymbol{d} x^{\alpha} \otimes \boldsymbol{d} x^{\beta} \tag{42.14} \end{equation*}(42.14)F(,)=Fαβdxαdxβ
Note from the definition of the components that the tensor is antisymmetric and so the Faraday tensor can be written as the 2 -form 7 7 ^(7){ }^{7}7
(42.15) F ~ ( , ) = 1 2 F α β d x α d x β (42.15) F ~ ( , ) = 1 2 F α β d x α d x β {:(42.15) tilde(F)(",")=(1)/(2)F_(alpha beta)dx^(alpha)^^dx^(beta):}\begin{equation*} \tilde{\boldsymbol{F}}(,)=\frac{1}{2} F_{\alpha \beta} \boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta} \tag{42.15} \end{equation*}(42.15)F~(,)=12Fαβdxαdxβ
We shall see that it is this tensor which gives us access to the electromagnetic fields. We can summarize the equation of motion symbolically as 8 as 8 as^(8)\mathrm{as}^{8}as8
(42.17) d p ~ d τ = q F ~ ( , u ) (42.17) d p ~ d τ = q F ~ ( , u ) {:(42.17)(d( tilde(p)))/((d)tau)=q tilde(F)(","u):}\begin{equation*} \frac{\mathrm{d} \tilde{\boldsymbol{p}}}{\mathrm{~d} \tau}=q \tilde{\boldsymbol{F}}(, \boldsymbol{u}) \tag{42.17} \end{equation*}(42.17)dp~ dτ=qF~(,u)
where the momentum 1 -form features on the left-hand side and the velocity u u u\boldsymbol{u}u has been inserted into the second slot of the tensor F ~ F ~ tilde(F)\tilde{\boldsymbol{F}}F~ (corresponding to contracting against the second index of F α β F α β F_(alpha beta)F_{\alpha \beta}Fαβ in eqn 42.17). Since this expression is a valid tensor expression in flat space, it must also be applicable to curved space and represents the answer to our question of how a charge couples to the electromagnetic field. Next, we turn to the dynamics of the 1 -form field A ~ A ~ tilde(A)\tilde{\boldsymbol{A}}A~ and the 2 -form F ~ F ~ tilde(F)\tilde{\boldsymbol{F}}F~, as these provide the equations of motion of the electromagnetic fields, also known as Maxwell's equations. 9 9 ^(9){ }^{9}9

42.2 Faraday tensor and Maxwell equations

The field A ~ ( x ) = A μ ( x ) d x μ A ~ ( x ) = A μ ( x ) d x μ tilde(A)(x)=A_(mu)(x)dx^(mu)\tilde{\boldsymbol{A}}(x)=A_{\mu}(x) \boldsymbol{d} x^{\mu}A~(x)=Aμ(x)dxμ is a 1-form field known as the electromagnetic gauge field. The four components A μ = ( A 0 , A i ) A μ = A 0 , A i A_(mu)=(A_(0),A_(i))A_{\mu}=\left(A_{0}, A_{i}\right)Aμ=(A0,Ai) may, in flat
5 5 ^(5){ }^{5}5 As examined in the Exercises, the spatial components of the right-hand side of this equation give the relativistic version of the Lorentz force law
(42.11) F = q ( E + v × B ) (42.11) F = q ( E + v × B ) {:(42.11) vec(F)=q( vec(E)+ vec(v)xx vec(B)):}\begin{equation*} \vec{F}=q(\vec{E}+\vec{v} \times \vec{B}) \tag{42.11} \end{equation*}(42.11)F=q(E+v×B)
6 6 ^(6){ }^{6}6 Michael Faraday (1791-1867).
7 7 ^(7){ }^{7}7 Remember that the factor 1 / 2 1 / 2 1//21 / 21/2 arises because
1 2 F α β d x α d x β = 1 2 ( F α β d x α d x β F α β d x β d x α ) = 1 2 ( F α β d x α d x β F β α d x α d x β ) = F α β d x α d x β , 1 2 F α β d x α d x β = 1 2 F α β d x α d x β F α β d x β d x α = 1 2 F α β d x α d x β F β α d x α d x β = F α β d x α d x β , {:[(1)/(2)F_(alpha beta)dx^(alpha)^^dx^(beta)],[=(1)/(2)(F_(alpha beta)dx^(alpha)ox dx^(beta):}],[{:-F_(alpha beta)dx^(beta)ox dx^(alpha))],[=(1)/(2)(F_(alpha beta)dx^(alpha)ox dx^(beta):}],[{:-F_(beta alpha)dx^(alpha)ox dx^(beta))],[=F_(alpha beta)dx^(alpha)ox dx^(beta)","]:}\begin{aligned} & \frac{1}{2} F_{\alpha \beta} \boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta} \\ = & \frac{1}{2}\left(F_{\alpha \beta} \boldsymbol{d} x^{\alpha} \otimes \boldsymbol{d} x^{\beta}\right. \\ & \left.-F_{\alpha \beta} \boldsymbol{d} x^{\beta} \otimes \boldsymbol{d} x^{\alpha}\right) \\ = & \frac{1}{2}\left(F_{\alpha \beta} \boldsymbol{d} x^{\alpha} \otimes \boldsymbol{d} x^{\beta}\right. \\ & \left.-F_{\beta \alpha} \boldsymbol{d} x^{\alpha} \otimes \boldsymbol{d} x^{\beta}\right) \\ = & F_{\alpha \beta} \boldsymbol{d} x^{\alpha} \otimes \boldsymbol{d} x^{\beta}, \end{aligned}12Fαβdxαdxβ=12(FαβdxαdxβFαβdxβdxα)=12(FαβdxαdxβFβαdxαdxβ)=Fαβdxαdxβ,
where the antisymmetry of the components is used to produce the last line. 8 8 ^(8){ }^{8}8 That is, we lower the index with x α = x α = x_(alpha)=x_{\alpha}=xα= η α μ x μ η α μ x μ eta_(alpha mu)x^(mu)\eta_{\alpha \mu} x^{\mu}ηαμxμ and then see that the left-hand side can be written as
m d d τ u ~ m d d τ u ~ m((d))/((d)tau) tilde(u)m \frac{\mathrm{~d}}{\mathrm{~d} \tau} \tilde{\boldsymbol{u}}m d dτu~
where u α = d x α / d τ u α = d x α / d τ u_(alpha)=dx_(alpha)//dtauu_{\alpha}=\mathrm{d} x_{\alpha} / \mathrm{d} \tauuα=dxα/dτ, before noting that p ~ = m u ~ p ~ = m u ~ tilde(p)=m tilde(u)\tilde{\boldsymbol{p}}=m \tilde{\boldsymbol{u}}p~=mu~. Note also that since F ~ ( u , u ) = 0 F ~ ( u , u ) = 0 tilde(F)(u,u)=0\tilde{\boldsymbol{F}}(\boldsymbol{u}, \boldsymbol{u})=0F~(u,u)=0, which follows from the antisymmetry of the Faraday tensor, we have on the left-hand side an expression that effectively says u a = 0 u a = 0 u*a=0\boldsymbol{u} \cdot \boldsymbol{a}=0ua=0, where a a a\boldsymbol{a}a is the acceleration, which is a condition required for our theory (see Chapter 1). Finally, the upgrade to curved space simply swaps commas for semispace simply
colons to give
D p ~ d τ m u u ~ = q F ~ ( , u ) D p ~ d τ m u u ~ = q F ~ ( , u ) (D( tilde(p)))/((d)tau)-=mgrad_(u) tilde(u)=q tilde(F)(,u)\frac{\mathrm{D} \tilde{\boldsymbol{p}}}{\mathrm{~d} \tau} \equiv m \nabla_{u} \tilde{\boldsymbol{u}}=q \tilde{\boldsymbol{F}}(, \boldsymbol{u})Dp~ dτmuu~=qF~(,u)
The equation of motion tells us that the departure from geodesic free fall is given by the combination of the charge, Faraday tensor and particle velocity. 9 9 ^(9){ }^{9}9 James Clerk Maxwell (1831-1879).
10 10 ^(10){ }^{10}10 As usual, A i A i A^(i)A^{i}Ai are the components of the spatial 3 -vector A A vec(A)\vec{A}A.
11 11 ^(11){ }^{11}11 Note that in flat space we can also transform these to the components of a ( 2 , 0 ) ( 2 , 0 ) (2,0)(2,0)(2,0) tensor F = F μ ν e μ e ν = F = F μ ν e μ e ν = F=F^(mu nu)e_(mu)oxe_(nu)=\boldsymbol{F}=F^{\mu \nu} \boldsymbol{e}_{\mu} \otimes \boldsymbol{e}_{\nu}=F=Fμνeμeν= 1 2 1 F μ ν e μ e ν 1 2 1 F μ ν e μ e ν _((1)/(2))^(1)F^(mu nu)e_(mu)^^e_(nu){ }_{\frac{1}{2}}^{1} F^{\mu \nu} \boldsymbol{e}_{\mu} \wedge \boldsymbol{e}_{\nu}121Fμνeμeν, where
F μ ν = F μ ν = F^(mu nu)=F^{\mu \nu}=Fμν=
(42.21) ( 0 E x E y E z E x 0 B z B y E y B z 0 B x E z B y B x 0 ) (42.21) 0 E x E y E z E x 0 B z B y E y B z 0 B x E z B y B x 0 {:(42.21)([0,E^(x),E^(y),E^(z)],[-E^(x),0,B^(z),-B^(y)],[-E^(y),-B^(z),0,B^(x)],[-E^(z),B^(y),-B^(x),0]):}\left(\begin{array}{cccc} 0 & E^{x} & E^{y} & E^{z} \tag{42.21}\\ -E^{x} & 0 & B^{z} & -B^{y} \\ -E^{y} & -B^{z} & 0 & B^{x} \\ -E^{z} & B^{y} & -B^{x} & 0 \end{array}\right)(42.21)(0ExEyEzEx0BzByEyBz0BxEzByBx0)
Notice how the components of the E E vec(E)\vec{E}E field change their signs, but those of the B B vec(B)\vec{B}B field do not.
12 12 ^(12){ }^{12}12 It is therefore J J JJJ that tells the field lines where to go (and the field lines tell J J J\boldsymbol{J}J how to evolve in spacetime).
13 13 ^(13){ }^{13}13 This is related to the Bianchi identity for gravitation that we have met in Chapter 13. The link will be discussed in detail in Chapter 43, but we shall show how the identity arises geometrically in electromagnetism at the end of this chapter
14 14 ^(14){ }^{14}14 Set α = 1 , β = 2 α = 1 , β = 2 alpha=1,beta=2\alpha=1, \beta=2α=1,β=2 and γ = 3 γ = 3 gamma=3\gamma=3γ=3.
15 15 ^(15){ }^{15}15 Set one of the indices equal to zero and this follows.
space, be mapped with metric η η eta\boldsymbol{\eta}η onto a ( 1,0 ) vector field A ( x ) A ( x ) A(x)\boldsymbol{A}(x)A(x) with components A μ = ( A 0 , A i ) A μ = A 0 , A i A^(mu)=(A^(0),A^(i))A^{\mu}=\left(A^{0}, A^{i}\right)Aμ=(A0,Ai), where A 0 = A 0 A 0 = A 0 A_(0)=-A^(0)A_{0}=-A^{0}A0=A0 and 10 A i = A i 10 A i = A i ^(10)A_(i)=A^(i){ }^{10} A_{i}=A^{i}10Ai=Ai. The timelike component A 0 A 0 A^(0)A^{0}A0 is often called the electrostatic potential V V VVV.
We then define the 3 -vector electric field E E vec(E)\vec{E}E in terms of the components of the gauge field as
(42.18) E = A 0 A t (42.18) E = A 0 A t {:(42.18) vec(E)=- vec(grad)A^(0)-(del( vec(A)))/(del t):}\begin{equation*} \vec{E}=-\vec{\nabla} A^{0}-\frac{\partial \vec{A}}{\partial t} \tag{42.18} \end{equation*}(42.18)E=A0At
and the magnetic field B B vec(B)\vec{B}B as
(42.19) B = × A . (42.19) B = × A . {:(42.19) vec(B)= vec(grad)xx vec(A).:}\begin{equation*} \vec{B}=\vec{\nabla} \times \vec{A} . \tag{42.19} \end{equation*}(42.19)B=×A.
An important property of these equations is that there is some freedom in how the components A μ A μ A_(mu)A_{\mu}Aμ are specified. In fact, if we make the gauge transformation
(42.20) A μ ( x ) A μ ( x ) χ ( x ) x μ (42.20) A μ ( x ) A μ ( x ) χ ( x ) x μ {:(42.20)A_(mu)(x)rarrA_(mu)(x)-(del chi(x))/(delx^(mu)):}\begin{equation*} A_{\mu}(x) \rightarrow A_{\mu}(x)-\frac{\partial \chi(x)}{\partial x^{\mu}} \tag{42.20} \end{equation*}(42.20)Aμ(x)Aμ(x)χ(x)xμ
where χ ( x ) χ ( x ) chi(x)\chi(x)χ(x) is some arbitrary function, then the E E vec(E)\vec{E}E and B B vec(B)\vec{B}B fields are unchanged. This turns out to be of fundamental importance to the theory and is discussed further in the next section. Our task here is to derive the equations of motion of the electromagnetic field.
Since the components of the Faraday tensor are given by F μ ν = A ν , μ F μ ν = A ν , μ F_(mu nu)=A_(nu,mu)-F_{\mu \nu}=A_{\nu, \mu}-Fμν=Aν,μ A μ , ν A μ , ν A_(mu,nu)A_{\mu, \nu}Aμ,ν then, in terms of the components of E E vec(E)\vec{E}E and B B vec(B)\vec{B}B fields, we can write the matrix as 11 11 ^(11){ }^{11}11
(42.22) F μ ν = ( 0 E x E y E z E x 0 B z B y E y B z 0 B x E z B y B x 0 ) . (42.22) F μ ν = 0 E x E y E z E x 0 B z B y E y B z 0 B x E z B y B x 0 . {:(42.22)F_(mu nu)=([0,-E^(x),-E^(y),-E^(z)],[E^(x),0,B^(z),-B^(y)],[E^(y),-B^(z),0,B^(x)],[E^(z),B^(y),-B^(x),0]).:}F_{\mu \nu}=\left(\begin{array}{cccc} 0 & -E^{x} & -E^{y} & -E^{z} \tag{42.22}\\ E^{x} & 0 & B^{z} & -B^{y} \\ E^{y} & -B^{z} & 0 & B^{x} \\ E^{z} & B^{y} & -B^{x} & 0 \end{array}\right) .(42.22)Fμν=(0ExEyEzEx0BzByEyBz0BxEzByBx0).
The components of F ~ F ~ tilde(F)\tilde{\boldsymbol{F}}F~ have dynamics of their own, encoded in Maxwell's equations of motion. Recall that Maxwell's equations are written in natural units as
(42.23) E = ρ , B = 0 , × E = B t , × B = J + E t . (42.23) E = ρ , B = 0 , × E = B t , × B = J + E t . {:(42.23){:[ vec(grad)* vec(E)=rho",", vec(grad)* vec(B)=0","],[ vec(grad)xx vec(E)=-(del( vec(B)))/(del t)",", vec(grad)xx vec(B)= vec(J)+(del( vec(E)))/(del t).]:}:}\begin{array}{cc} \vec{\nabla} \cdot \vec{E}=\rho, & \vec{\nabla} \cdot \vec{B}=0, \\ \vec{\nabla} \times \vec{E}=-\frac{\partial \vec{B}}{\partial t}, & \vec{\nabla} \times \vec{B}=\vec{J}+\frac{\partial \vec{E}}{\partial t} . \tag{42.23} \end{array}(42.23)E=ρ,B=0,×E=Bt,×B=J+Et.
We combine the charge density ρ ρ rho\rhoρ and the current density J J vec(J)\vec{J}J to create the current 4-vector J J J\boldsymbol{J}J with components J μ = ( ρ , J ) J μ = ( ρ , J ) J^(mu)=(rho, vec(J))J^{\mu}=(\rho, \vec{J})Jμ=(ρ,J). This vector represents the source of the electromagnetic fields (i.e. charges and currents) and also the bodies that are set in motion by the fields. 12 12 ^(12){ }^{12}12
Maxwell's equations can be re-expressed in terms of the components of the Faraday tensor. It can be checked that the 13 13 ^(13){ }^{13}13 Bianchi identity
(42.24) F α β , γ + F γ α , β + F β γ , α = 0 , (42.24) F α β , γ + F γ α , β + F β γ , α = 0 , {:(42.24)F_(alpha beta,gamma)+F_(gamma alpha,beta)+F_(beta gamma,alpha)=0",":}\begin{equation*} F_{\alpha \beta, \gamma}+F_{\gamma \alpha, \beta}+F_{\beta \gamma, \alpha}=0, \tag{42.24} \end{equation*}(42.24)Fαβ,γ+Fγα,β+Fβγ,α=0,
yields the Maxwell equations 14 B = 0 14 B = 0 ^(14) vec(grad)* vec(B)=0{ }^{14} \vec{\nabla} \cdot \vec{B}=014B=0 and 15 B / t = × E 15 B / t = × E ^(15)-del vec(B)//del t= vec(grad)xx vec(E){ }^{15}-\partial \vec{B} / \partial t=\vec{\nabla} \times \vec{E}15B/t=×E. The other two Maxwell equations (involving the components of the current J ) J ) J)J)J) can be recreated using the equation
(42.25) F μ ν , ν = J μ . (42.25) F μ ν , ν = J μ . {:(42.25)F^(mu nu)_(,nu)=J^(mu).:}\begin{equation*} F^{\mu \nu}{ }_{, \nu}=J^{\mu} . \tag{42.25} \end{equation*}(42.25)Fμν,ν=Jμ.
Example 42.2
Returning to the Lagrangian point of view, we claimed in Chapter 40 that the Lagrangian density for the electromagnetic matter fields themselves was L = L = L=\mathcal{L}=L= 1 4 F μ ν F μ ν 1 4 F μ ν F μ ν -(1)/(4)F_(mu nu)F^(mu nu)-\frac{1}{4} F_{\mu \nu} F^{\mu \nu}14FμνFμν. We can also include a generalized version of the coupling with Lagrangian L = q d x μ A μ L = q d x μ A μ L=qdx^(mu)A_(mu)L=q \mathrm{~d} x^{\mu} A_{\mu}L=q dxμAμ, which becomes a contribution to the Lagrangian density of L emf = J μ A μ L emf  = J μ A μ L_("emf ")=J^(mu)A_(mu)\mathcal{L}_{\text {emf }}=J^{\mu} A_{\mu}Lemf =JμAμ. We then have the action for the electromagnetic fields and their coupling to charges as given by
S em = d 4 x ( 1 4 F μ ν F μ ν + J μ A μ ) (42.26) = d 4 x [ 1 4 ( μ A ν ν A μ ) ( μ A ν ν A μ ) + J μ A μ ] S em = d 4 x 1 4 F μ ν F μ ν + J μ A μ (42.26) = d 4 x 1 4 μ A ν ν A μ μ A ν ν A μ + J μ A μ {:[S_(em)=intd^(4)x(-(1)/(4)F_(mu nu)F^(mu nu)+J^(mu)A_(mu))],[(42.26)=intd^(4)x[-(1)/(4)(del_(mu)A_(nu)-del_(nu)A_(mu))(del^(mu)A^(nu)-del^(nu)A^(mu))+J^(mu)A_(mu)]]:}\begin{align*} S_{\mathrm{em}} & =\int \mathrm{d}^{4} x\left(-\frac{1}{4} F_{\mu \nu} F^{\mu \nu}+J^{\mu} A_{\mu}\right) \\ & =\int \mathrm{d}^{4} x\left[-\frac{1}{4}\left(\partial_{\mu} A_{\nu}-\partial_{\nu} A_{\mu}\right)\left(\partial^{\mu} A^{\nu}-\partial^{\nu} A^{\mu}\right)+J^{\mu} A_{\mu}\right] \tag{42.26} \end{align*}Sem=d4x(14FμνFμν+JμAμ)(42.26)=d4x[14(μAννAμ)(μAννAμ)+JμAμ]
Feeding the Lagrangian density in the integrand through the Euler-Lagrange equations yields the Maxwell equations.
This is all very well, but in order to understand the Maxwell equations in the context of gravitation, we should examine the geometrical interpretation of electromagnetism. Indeed we shall shortly see how a geometrical viewpoint leads to the Maxwell equations.

Example 42.3

Before we get to a coordinate-free formulation, we can get an idea of how curvature changes electromagnetism. 16 16 ^(16){ }^{16}16 For simplicity we set the currents J μ J μ J^(mu)J^{\mu}Jμ to zero. In curved space with metric components g μ ν g μ ν g_(mu nu)g_{\mu \nu}gμν, we produce the Faraday tensor using the commas-go-to-semicolons rule, to give
(42.27) F μ ν = A ν ; μ A μ ; ν ( = 2 A [ ν ; μ ] ) (42.27) F μ ν = A ν ; μ A μ ; ν = 2 A [ ν ; μ ] {:(42.27)F_(mu nu)=A_(nu;mu)-A_(mu;nu)(=2A_([nu;mu])):}\begin{equation*} F_{\mu \nu}=A_{\nu ; \mu}-A_{\mu ; \nu}\left(=2 A_{[\nu ; \mu]}\right) \tag{42.27} \end{equation*}(42.27)Fμν=Aν;μAμ;ν(=2A[ν;μ])
Since we must be careful to use the metric to raise indices, we rewrite the flat-space Lagrangian density L em = 1 4 F μ ν F μ ν L em = 1 4 F μ ν F μ ν L_(em)=-(1)/(4)F^(mu nu)F_(mu nu)\mathcal{L}_{\mathrm{em}}=-\frac{1}{4} F^{\mu \nu} F_{\mu \nu}Lem=14FμνFμν, which becomes
(42.28) L em = 1 4 F α β F γ δ g α γ g β δ (42.28) L em = 1 4 F α β F γ δ g α γ g β δ {:(42.28)L_(em)=-(1)/(4)F_(alpha beta)F_(gamma delta)g^(alpha gamma)g^(beta delta):}\begin{equation*} \mathcal{L}_{\mathrm{em}}=-\frac{1}{4} F_{\alpha \beta} F_{\gamma \delta} g^{\alpha \gamma} g^{\beta \delta} \tag{42.28} \end{equation*}(42.28)Lem=14FαβFγδgαγgβδ
The equations of motion can then be found using the E-L equations. We have
(42.29) L m A α ; β = 0 (42.29) L m A α ; β = 0 {:(42.29)(delL_(m))/(delA_(alpha;beta))=0:}\begin{equation*} \frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial A_{\alpha ; \beta}}=0 \tag{42.29} \end{equation*}(42.29)LmAα;β=0
and
(42.30) L m A α ; β = ( A δ ; γ A γ ; δ ) g α γ g β δ (42.30) L m A α ; β = A δ ; γ A γ ; δ g α γ g β δ {:(42.30)(delL_(m))/(delA_(alpha;beta))=(A_(delta;gamma)-A_(gamma;delta))g^(alpha gamma)g^(beta delta):}\begin{equation*} \frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial A_{\alpha ; \beta}}=\left(A_{\delta ; \gamma}-A_{\gamma ; \delta}\right) g^{\alpha \gamma} g^{\beta \delta} \tag{42.30} \end{equation*}(42.30)LmAα;β=(Aδ;γAγ;δ)gαγgβδ
from which we find
( L m A α ; β ) ; β = ( A δ ; γ A γ ; δ ) ; β g α γ g β δ , (42.31) = F γ δ ; β g α γ g β δ = 0 , L m A α ; β ; β = A δ ; γ A γ ; δ ; β g α γ g β δ , (42.31) = F γ δ ; β g α γ g β δ = 0 , {:[((delL_(m))/(delA_(alpha;beta)))_(;beta)=(A_(delta;gamma)-A_(gamma;delta))_(;beta)g^(alpha gamma)g^(beta delta)","],[(42.31)=F_(gamma delta;beta)g^(alpha gamma)g^(beta delta)=0","]:}\begin{align*} \left(\frac{\partial \mathcal{L}_{\mathrm{m}}}{\partial A_{\alpha ; \beta}}\right)_{; \beta} & =\left(A_{\delta ; \gamma}-A_{\gamma ; \delta}\right)_{; \beta} g^{\alpha \gamma} g^{\beta \delta}, \\ & =F_{\gamma \delta ; \beta} g^{\alpha \gamma} g^{\beta \delta}=0, \tag{42.31} \end{align*}(LmAα;β);β=(Aδ;γAγ;δ);βgαγgβδ,(42.31)=Fγδ;βgαγgβδ=0,
where the last equality follows from the E-L equations. Since the g α γ g α γ g^(alpha gamma)g^{\alpha \gamma}gαγ simply multiplies all of the terms in the sum, it can be dropped. The equation is then equivalent to
(42.32) F γ δ ; β g β δ = 0 . (42.32) F γ δ ; β g β δ = 0 . {:(42.32)F_(gamma delta;beta)g^(beta delta)=0.:}\begin{equation*} F_{\gamma \delta ; \beta} g^{\beta \delta}=0 . \tag{42.32} \end{equation*}(42.32)Fγδ;βgβδ=0.
This expression gives two of the Maxwell equations in curved space, in the absence of sources. 17 17 ^(17){ }^{17}17
16 16 ^(16){ }^{16}16 The electromagnetic field itself generates curvature via its energymomentum tensor, as discussed below.
18 18 ^(18){ }^{18}18 The Poynting vector, named after J. H. Poynting (1852-1914) who invented it, is defined in classical electromagnetism as the vector S = E × H S = E × H vec(S)= vec(E)xx vec(H)\vec{S}=\vec{E} \times \vec{H}S=E×H (equivalently S = E × B / μ 0 S = E × B / μ 0 vec(S)= vec(E)xx vec(B)//mu_(0)\vec{S}=\vec{E} \times \vec{B} / \mu_{0}S=E×B/μ0 ). Its magnitude gives the energy flux (the energy flow per unit time, per unit area) and its direction indicates the direction of the energy flow. Our expression for T 0 j T 0 j T^(0j)T^{0 j}T0j gives the j j jjj th component of S S vec(S)\vec{S}S, with the constant μ 0 μ 0 mu_(0)\mu_{0}μ0 disappearing in our units.
\curvearrowright The concept of gauges is
important for describing sev-
eral topics in the coming chap-
ters, including gravitational
waves. Gauges are explained
in more detail in Chapter 44 .
19 19 ^(19){ }^{19}19 It might be helpful to think of the choice of gauge as a choice of language, where we are able to communicate the same message no matter which language we choose.
20 20 ^(20){ }^{20}20 The notation here makes use of
2 2 t 2 + 2 x 2 + 2 y 2 + 2 z 2 2 2 t 2 + 2 x 2 + 2 y 2 + 2 z 2 del^(2)-=-(del^(2))/(delt^(2))+(del^(2))/(delx^(2))+(del^(2))/(dely^(2))+(del^(2))/(delz^(2))\partial^{2} \equiv-\frac{\partial^{2}}{\partial t^{2}}+\frac{\partial^{2}}{\partial x^{2}}+\frac{\partial^{2}}{\partial y^{2}}+\frac{\partial^{2}}{\partial z^{2}}22t2+2x2+2y2+2z2
(42.39) = 2 t 2 + 2 (42.39) = 2 t 2 + 2 {:(42.39)=-(del^(2))/(delt^(2))+ vec(grad)^(2):}\begin{equation*} =-\frac{\partial^{2}}{\partial t^{2}}+\vec{\nabla}^{2} \tag{42.39} \end{equation*}(42.39)=2t2+2
Central to the role of fields in general relativity is the energy-momentum tensor T ( T ( T(\boldsymbol{T}(T(, ) . W e m e t a m e t h o d i n C h a p t e r 40 ) . W e m e t a m e t h o d i n C h a p t e r 40 ).WemetamethodinChapter 40) . We met a method in Chapter 40).WemetamethodinChapter40 that allows us to derive this object from the Lagrangian of the electromagnetic fields. In Minkowski spacetime, the ( 2,0 ) energy-momentum tensor T T T\boldsymbol{T}T for the electromagnetic field has components
(42.34) T μ ν = F μ α F α ν 1 4 η μ ν F α β F α β . (42.34) T μ ν = F μ α F α ν 1 4 η μ ν F α β F α β . {:(42.34)T^(mu nu)=F^(mu alpha)F_(alpha)^(nu)-(1)/(4)eta^(mu nu)F_(alpha beta)F^(alpha beta).:}\begin{equation*} T^{\mu \nu}=F^{\mu \alpha} F_{\alpha}^{\nu}-\frac{1}{4} \eta^{\mu \nu} F_{\alpha \beta} F^{\alpha \beta} . \tag{42.34} \end{equation*}(42.34)Tμν=FμαFαν14ημνFαβFαβ.
Example 42.4
The components of this tensor can be recast in terms of the E E EEE - and B B BBB-fields. We find that the energy density is
(42.35) T 00 = 1 2 ( E 2 + B 2 ) (42.35) T 00 = 1 2 E 2 + B 2 {:(42.35)T^(00)=(1)/(2)( vec(E)^(2)+ vec(B)^(2)):}\begin{equation*} T^{00}=\frac{1}{2}\left(\vec{E}^{2}+\vec{B}^{2}\right) \tag{42.35} \end{equation*}(42.35)T00=12(E2+B2)
while the momentum density has components
(42.36) T 0 j = ( E × B ) j (42.36) T 0 j = ( E × B ) j {:(42.36)T^(0j)=( vec(E)xx vec(B))^(j):}\begin{equation*} T^{0 j}=(\vec{E} \times \vec{B})^{j} \tag{42.36} \end{equation*}(42.36)T0j=(E×B)j
These latter components are often given the name the Poynting vector. 18 18 ^(18){ }^{18}18 Finally, the all-spatial components of T T T\boldsymbol{T}T are
(42.37) T j k = [ ( E j E k + B j B k ) 1 2 ( E 2 + B 2 ) δ j k ] (42.37) T j k = E j E k + B j B k 1 2 E 2 + B 2 δ j k {:(42.37)T^(jk)=-[(E^(j)E^(k)+B^(j)B^(k))-(1)/(2)( vec(E)^(2)+ vec(B)^(2))delta^(jk)]:}\begin{equation*} T^{j k}=-\left[\left(E^{j} E^{k}+B^{j} B^{k}\right)-\frac{1}{2}\left(\vec{E}^{2}+\vec{B}^{2}\right) \delta^{j k}\right] \tag{42.37} \end{equation*}(42.37)Tjk=[(EjEk+BjBk)12(E2+B2)δjk]
In curved spacetime, we can write the ( 0 , 2 ) ( 0 , 2 ) (0,2)(0,2)(0,2) version of the energymomentum tensor T T T\boldsymbol{T}T in terms of
(42.38) T μ ν = F μ α F ν β g α β 1 4 g μ ν F γ δ F λ ρ g γ λ g δ ρ (42.38) T μ ν = F μ α F ν β g α β 1 4 g μ ν F γ δ F λ ρ g γ λ g δ ρ {:(42.38)T_(mu nu)=F_(mu alpha)F_(nu beta)g^(alpha beta)-(1)/(4)g_(mu nu)F_(gamma delta)F_(lambda rho)g^(gamma lambda)g^(delta rho):}\begin{equation*} T_{\mu \nu}=F_{\mu \alpha} F_{\nu \beta} g^{\alpha \beta}-\frac{1}{4} g_{\mu \nu} F_{\gamma \delta} F_{\lambda \rho} g^{\gamma \lambda} g^{\delta \rho} \tag{42.38} \end{equation*}(42.38)Tμν=FμαFνβgαβ14gμνFγδFλρgγλgδρ
It is the tensor T ( T ( T(\boldsymbol{T}(T(, ) w i t h t h e s e c o m p o n e n t s t h a t c o u p l e s t o t h e c u r v a ) w i t h t h e s e c o m p o n e n t s t h a t c o u p l e s t o t h e c u r v a )withthesecomponentsthatcouplestothecurva-) with these components that couples to the curva-)withthesecomponentsthatcouplestothecurva ture of spacetime.

42.3 Gauge freedom

The electromagnetic 1-form field A ~ ( x ) A ~ ( x ) tilde(A)(x)\tilde{\boldsymbol{A}}(x)A~(x) is an important one. In several forthcoming chapters, we shall have cause to invoke the principle of gauge freedom, by using eqn 42.20 . The principle says that no physics changes if we make the change A μ ( x ) A μ ( x ) χ , μ ( x ) A μ ( x ) A μ ( x ) χ , μ ( x ) A_(mu)(x)rarrA_(mu)(x)-chi_(,mu)(x)A_{\mu}(x) \rightarrow A_{\mu}(x)-\chi_{, \mu}(x)Aμ(x)Aμ(x)χ,μ(x). That is to say, the gauge transformation cannot change the result of any measurement. The choice of the smooth function χ ( x ) χ ( x ) chi(x)\chi(x)χ(x) is known as a choice of gauge. 19 19 ^(19){ }^{19}19 A sensible choice of the function χ ( x ) χ ( x ) chi(x)\chi(x)χ(x) can simplify a problem.

Example 42.5

Consider how the expression ν F μ ν = J μ ν F μ ν = J μ del_(nu)F^(mu nu)=J^(mu)\partial_{\nu} F^{\mu \nu}=J^{\mu}νFμν=Jμ can be rewritten in terms of the components A μ A μ A^(mu)A^{\mu}Aμ as 20 20 ^(20)^{20}20
(42.40) μ ( ν A ν ) 2 A μ = J μ (42.40) μ ν A ν 2 A μ = J μ {:(42.40)del^(mu)(del_(nu)A^(nu))-del^(2)A^(mu)=J^(mu):}\begin{equation*} \partial^{\mu}\left(\partial_{\nu} A^{\nu}\right)-\partial^{2} A^{\mu}=J^{\mu} \tag{42.40} \end{equation*}(42.40)μ(νAν)2Aμ=Jμ
It would simplify our use of this equation of motion if the first term on the left-hand side could be made to go away through a sensible choice of χ χ chi\chiχ. To do this we write A μ A μ = A μ μ χ A μ A μ = A μ μ χ A_(mu)rarrA_(mu)^(')=A_(mu)-del_(mu)chiA_{\mu} \rightarrow A_{\mu}^{\prime}=A_{\mu}-\partial_{\mu} \chiAμAμ=Aμμχ. What we want is then
(42.41) μ A μ = μ A μ μ μ χ = 0 (42.41) μ A μ = μ A μ μ μ χ = 0 {:(42.41)del^(mu)A_(mu)^(')=del^(mu)A_(mu)-del^(mu)del_(mu)chi=0:}\begin{equation*} \partial^{\mu} A_{\mu}^{\prime}=\partial^{\mu} A_{\mu}-\partial^{\mu} \partial_{\mu} \chi=0 \tag{42.41} \end{equation*}(42.41)μAμ=μAμμμχ=0
which we can satisfy by setting 2 χ = μ A μ 2 χ = μ A μ del^(2)chi=del^(mu)A_(mu)\partial^{2} \chi=\partial^{\mu} A_{\mu}2χ=μAμ. The equation of motion for the electromagnetic field then becomes
(42.42) 2 A μ = J μ (42.42) 2 A μ = J μ {:(42.42)del^(2)A^(mu)=-J^(mu):}\begin{equation*} \partial^{2} A^{\mu}=-J^{\mu} \tag{42.42} \end{equation*}(42.42)2Aμ=Jμ
Note that in the absence of sources, this tells us 2 A μ = 0 2 A μ = 0 del^(2)A^(mu)=0\partial^{2} A^{\mu}=02Aμ=0, which is a wave equation 21 21 ^(21){ }^{21}21 for the components A ν A ν A^(nu)A^{\nu}Aν, which is to say
(42.43) 2 A μ t 2 + 2 A μ = 0 (42.43) 2 A μ t 2 + 2 A μ = 0 {:(42.43)-(del^(2)A^(mu))/(delt^(2))+ vec(grad)^(2)A^(mu)=0:}\begin{equation*} -\frac{\partial^{2} A^{\mu}}{\partial t^{2}}+\vec{\nabla}^{2} A^{\mu}=0 \tag{42.43} \end{equation*}(42.43)2Aμt2+2Aμ=0
The electromagnetic field propagates through space at the speed of light.
The choice of χ χ chi\chiχ such that μ A μ = 0 μ A μ = 0 del_(mu)A^(mu)=0\partial_{\mu} A^{\mu}=0μAμ=0 is known as 22 22 ^(22){ }^{22}22 the Lorenz gauge. In fact, the Lorenz gauge does not exhaust the gauge freedom.
Example 42.6
This is because we can make a further shift A μ A μ = A μ μ ξ A μ A μ = A μ μ ξ A_(mu)^(')rarrA_(mu)^('')=A_(mu)^(')-del_(mu)xiA_{\mu}^{\prime} \rightarrow A_{\mu}^{\prime \prime}=A_{\mu}^{\prime}-\partial_{\mu} \xiAμAμ=Aμμξ as long as 2 ξ = 0 2 ξ = 0 del^(2)xi=0\partial^{2} \xi=02ξ=0 (so that both A μ A μ A_(mu)^(')A_{\mu}^{\prime}Aμ and A μ A μ A_(mu)^('')A_{\mu}^{\prime \prime}Aμ satisfy the Lorenz condition). To make A μ A μ A^(''mu)A^{\prime \prime \mu}Aμ unique, we shall choose
(42.44) 0 ξ = A 0 (42.44) 0 ξ = A 0 {:(42.44)del_(0)xi=A_(0)^('):}\begin{equation*} \partial_{0} \xi=A_{0}^{\prime} \tag{42.44} \end{equation*}(42.44)0ξ=A0
which implies A 0 = 0 A 0 = 0 A_(0)^('')=0A_{0}^{\prime \prime}=0A0=0. With this further choice, the Lorenz condition then reduces to A = 0 A = 0 vec(grad)* vec(A^(''))=0\vec{\nabla} \cdot \overrightarrow{A^{\prime \prime}}=0A=0. This choice is known as Coulomb gauge. 23 23 ^(23){ }^{23}23
Similar choices of gauge will be important when we come to examine gravitational waves in Chapter 46. For now, it's useful to note that we can write down the solutions to the equation of motion for the electromagnetic field: 2 A μ = J μ 2 A μ = J μ del^(2)A^(mu)=-J^(mu)\partial^{2} A^{\mu}=-J^{\mu}2Aμ=Jμ.

Example 42.7

Consider the electromagnetic field at position x x vec(x)\vec{x}x that results from a source at x x vec(x)^(')\vec{x}^{\prime}x, as shown in Fig. 42.1. In the static case, we have
(42.45) 2 A μ = J μ ( static ) , (42.45) 2 A μ = J μ (  static  ) , {:(42.45) vec(grad)^(2)A^(mu)=-J^(mu)quad(" static ")",":}\begin{equation*} \vec{\nabla}^{2} A^{\mu}=-J^{\mu} \quad(\text { static }), \tag{42.45} \end{equation*}(42.45)2Aμ=Jμ( static ),
and so, for A μ = ( A 0 , A ) A μ = A 0 , A A^(mu)=(A^(0),( vec(A)))A^{\mu}=\left(A^{0}, \vec{A}\right)Aμ=(A0,A) and J μ = ( ρ , J ) J μ = ( ρ , J ) J^(mu)=(rho, vec(J))J^{\mu}=(\rho, \vec{J})Jμ=(ρ,J) we write
(42.46) 2 A 0 = ρ , 2 A = J (static) (42.46) 2 A 0 = ρ , 2 A = J  (static)  {:(42.46) vec(grad)^(2)A^(0)=-rho","quad vec(grad)^(2) vec(A)=- vec(J)quad" (static) ":}\begin{equation*} \vec{\nabla}^{2} A^{0}=-\rho, \quad \vec{\nabla}^{2} \vec{A}=-\vec{J} \quad \text { (static) } \tag{42.46} \end{equation*}(42.46)2A0=ρ,2A=J (static) 
We arrange the currents and charges in a three-volume V V V^(')\mathcal{V}^{\prime}V as shown in Fig. 42.1. These latter equations then have solutions 24 24 ^(24){ }^{24}24
(42.47) A 0 ( x ) = 1 4 π d V ρ ( x ) | x x | , A ( x ) = 1 4 π d V J ( x ) | x x | (static). (42.47) A 0 ( x ) = 1 4 π d V ρ x x x , A ( x ) = 1 4 π d V J x x x  (static).  {:(42.47)A^(0)( vec(x))=(1)/(4pi)intdV^(')(rho( vec(x)^(')))/(|( vec(x))- vec(x)^(')|)","quad vec(A)( vec(x))=(1)/(4pi)intdV^(')(( vec(J))( vec(x)^(')))/(|( vec(x))- vec(x)^(')|)quad" (static). ":}\begin{equation*} A^{0}(\vec{x})=\frac{1}{4 \pi} \int \mathrm{~d} \mathcal{V}^{\prime} \frac{\rho\left(\vec{x}^{\prime}\right)}{\left|\vec{x}-\vec{x}^{\prime}\right|}, \quad \vec{A}(\vec{x})=\frac{1}{4 \pi} \int \mathrm{~d} \mathcal{V}^{\prime} \frac{\vec{J}\left(\vec{x}^{\prime}\right)}{\left|\vec{x}-\vec{x}^{\prime}\right|} \quad \text { (static). } \tag{42.47} \end{equation*}(42.47)A0(x)=14π dVρ(x)|xx|,A(x)=14π dVJ(x)|xx| (static). 
In words: The charges and currents at positions x x vec(x)^(')\vec{x}^{\prime}x are added up to give the fields at x x vec(x)\vec{x}x. Since the electromagnetic field propagates at the speed of light, adding time dependence requires us to evaluate the charge and current distribution at the retarded time
(42.48) t r = t | x x | c (42.48) t r = t x x c {:(42.48)t_(r)=t-(|( vec(x))- vec(x)^(')|)/(c):}\begin{equation*} t_{\mathrm{r}}=t-\frac{\left|\vec{x}-\vec{x}^{\prime}\right|}{c} \tag{42.48} \end{equation*}(42.48)tr=t|xx|c
where we temporarily reinstate the speed of light c c ccc. This says that the field experienced at x μ = ( t , x ) x μ = ( t , x ) x^(mu)=(t, vec(x))x^{\mu}=(t, \vec{x})xμ=(t,x) is determined by adding up the charges and currents at x x vec(x)^(')\vec{x}^{\prime}x at a time t r t r t_(r)t_{\mathrm{r}}tr, since it takes | x x | / c x x / c |( vec(x))- vec(x)^(')|//c\left|\vec{x}-\vec{x}^{\prime}\right| / c|xx|/c seconds for the field to propagate to x x vec(x)\vec{x}x. The solutions are then given by the retarded potentials
(42.49) A 0 ( t , x ) = 1 4 π d 3 V ρ ( t r , x ) | x x | , A ( t , x ) = 1 4 π d V J ( t r , x ) | x x | . (42.49) A 0 ( t , x ) = 1 4 π d 3 V ρ t r , x | x x | , A ( t , x ) = 1 4 π d V J t r , x x x . {:(42.49)A^(0)(t"," vec(x))=(1)/(4pi)intd^(3)V^(')(rho(t_(r), vec(x)^(')))/(|( vec(x))-( vec(x))|)","quad vec(A)(t"," vec(x))=(1)/(4pi)intdV^(')(( vec(J))(t_(r), vec(x)^(')))/(|( vec(x))- vec(x)^(')|).:}\begin{equation*} A^{0}(t, \vec{x})=\frac{1}{4 \pi} \int \mathrm{~d}^{3} \mathcal{V}^{\prime} \frac{\rho\left(t_{r}, \vec{x}^{\prime}\right)}{|\vec{x}-\vec{x}|}, \quad \vec{A}(t, \vec{x})=\frac{1}{4 \pi} \int \mathrm{~d} \mathcal{V}^{\prime} \frac{\vec{J}\left(t_{r}, \vec{x}^{\prime}\right)}{\left|\vec{x}-\vec{x}^{\prime}\right|} . \tag{42.49} \end{equation*}(42.49)A0(t,x)=14π d3Vρ(tr,x)|xx|,A(t,x)=14π dVJ(tr,x)|xx|.
21 21 ^(21){ }^{21}21 A wave equation has the form 2 ϕ = 2 ϕ = del^(2)phi=\partial^{2} \phi=2ϕ= 2 ϕ t 2 + 2 ϕ = 0 2 ϕ t 2 + 2 ϕ = 0 -(del^(2)phi)/(delt^(2))+ vec(grad)^(2)phi=0-\frac{\partial^{2} \phi}{\partial t^{2}}+\vec{\nabla}^{2} \phi=02ϕt2+2ϕ=0. Its solutions are plane waves of the form ϕ = ϕ = phi=\phi=ϕ= e are plane i ( ω t k x ) . . e are plane  i ( ω t k x ) . . e^("are plane ")i(omega t- vec(k)* vec(x))..\mathrm{e}^{\text {are plane }} \mathrm{i}(\omega t-\vec{k} \cdot \vec{x}) . ~ . ~eare plane i(ωtkx). . 
22 22 ^(22){ }^{22}22 Actually it's more usually (and incorrectly) known as Lorentz gauge due to its misattribution to Hendrik Lorentz (1853-1928) rather than to the less famous Ludvig Lorenz (1829-1891) who used it first. See J. D. Jackson and L. B. Okun, Rev. Mod. Phys. 73, 663 (2001) for details of the history. In gravitation, the analogous choice of gauge is sometimes known as the harmonic gauge or de Donder gauge [the latter named after Théophile de Donder (1872-1957)].
23 23 ^(23){ }^{23}23 Charles-Augustin de Coulomb (1736-1806).
Fig. 42.1 The geometry for Example 42.7. The field at x x vec(x)\vec{x}x results from a distribution of currents and charges with elements at positions x x vec(x)^(')\vec{x}^{\prime}x in a threevolume V V V^(')\mathcal{V}^{\prime}V.
24 24 ^(24){ }^{24}24 See Exercise 42.7, where useful identities are given for filling in the algebra in this example.
25 25 ^(25){ }^{25}25 Our coordinate-free expressions can be straightforwardly written in coordinates in flat or curved spacetime. We shall continue to assume flat spacetime in our discussion.
(42.50) A μ ( t , x ) = 1 4 π d V J μ ( t r , x ) | x x | (42.50) A μ ( t , x ) = 1 4 π d V J μ t r , x x x {:(42.50)A^(mu)(t"," vec(x))=(1)/(4pi)intdV^(')(J^(mu)(t_(r), vec(x)^(')))/(|( vec(x))- vec(x)^(')|):}\begin{equation*} A^{\mu}(t, \vec{x})=\frac{1}{4 \pi} \int \mathrm{~d} \mathcal{V}^{\prime} \frac{J^{\mu}\left(t_{\mathrm{r}}, \vec{x}^{\prime}\right)}{\left|\vec{x}-\vec{x}^{\prime}\right|} \tag{42.50} \end{equation*}(42.50)Aμ(t,x)=14π dVJμ(tr,x)|xx|
We shall later use analogous potentials to describe the gravitational metric field generated by a distribution of masses and energies.

42.4 Geometrical electromagnetism

We now, finally, re-express the equations of electromagnetism in coordinate-free language. 25 25 ^(25){ }^{25}25 We start with the electromagnetic 1 -form gauge field A ~ ( x ) A ~ ( x ) tilde(A)(x)\tilde{\boldsymbol{A}}(x)A~(x). We take its exterior derivative and find that we obtain the 2 -form F F F\boldsymbol{F}F, or
(42.51) F ~ = d A ~ (42.51) F ~ = d A ~ {:(42.51) tilde(F)=d tilde(A):}\begin{equation*} \tilde{F}=d \tilde{\boldsymbol{A}} \tag{42.51} \end{equation*}(42.51)F~=dA~

Example 42.8

We pause to prove the assertion above. We have A ~ = A ν d x ν A ~ = A ν d x ν tilde(A)=A_(nu)dx^(nu)\tilde{\boldsymbol{A}}=A_{\nu} \boldsymbol{d} x^{\nu}A~=Aνdxν. The exterior derivative gives a 2 -form
d A ~ = A ν x μ d x μ d x ν = 1 2 A ν x μ d x μ d x μ 1 2 A ν x μ d x ν d x μ (42.52) = 1 2 ( A ν x μ A μ x ν ) d x μ d x ν d A ~ = A ν x μ d x μ d x ν = 1 2 A ν x μ d x μ d x μ 1 2 A ν x μ d x ν d x μ (42.52) = 1 2 A ν x μ A μ x ν d x μ d x ν {:[d tilde(A)=(delA_(nu))/(delx^(mu))dx^(mu)^^dx^(nu)],[=(1)/(2)(delA_(nu))/(delx^(mu))dx^(mu)^^dx^(mu)-(1)/(2)(delA_(nu))/(delx^(mu))dx^(nu)^^dx^(mu)],[(42.52)=(1)/(2)((delA_(nu))/(delx^(mu))-(delA_(mu))/(delx^(nu)))dx^(mu)^^dx^(nu)]:}\begin{align*} \boldsymbol{d} \tilde{\boldsymbol{A}} & =\frac{\partial A_{\nu}}{\partial x^{\mu}} \boldsymbol{d} x^{\mu} \wedge \boldsymbol{d} x^{\nu} \\ & =\frac{1}{2} \frac{\partial A_{\nu}}{\partial x^{\mu}} \boldsymbol{d} x^{\mu} \wedge \boldsymbol{d} x^{\mu}-\frac{1}{2} \frac{\partial A_{\nu}}{\partial x^{\mu}} \boldsymbol{d} x^{\nu} \wedge \boldsymbol{d} x^{\mu} \\ & =\frac{1}{2}\left(\frac{\partial A_{\nu}}{\partial x^{\mu}}-\frac{\partial A_{\mu}}{\partial x^{\nu}}\right) \boldsymbol{d} x^{\mu} \wedge \boldsymbol{d} x^{\nu} \tag{42.52} \end{align*}dA~=Aνxμdxμdxν=12Aνxμdxμdxμ12Aνxμdxνdxμ(42.52)=12(AνxμAμxν)dxμdxν
We see that F ~ = d A ~ F ~ = d A ~ tilde(F)=d tilde(A)\tilde{\boldsymbol{F}}=\boldsymbol{d} \tilde{\boldsymbol{A}}F~=dA~, where F ~ = 1 2 F μ ν d x μ d x ν F ~ = 1 2 F μ ν d x μ d x ν tilde(F)=(1)/(2)F_(mu nu)dx^(mu)^^dx^(nu)\tilde{\boldsymbol{F}}=\frac{1}{2} F_{\mu \nu} \boldsymbol{d} x^{\mu} \wedge \boldsymbol{d} x^{\nu}F~=12Fμνdxμdxν and
(42.53) F μ ν = ( A ν x μ A μ x ν ) (42.53) F μ ν = A ν x μ A μ x ν {:(42.53)F_(mu nu)=((delA_(nu))/(delx^(mu))-(delA_(mu))/(delx^(nu))):}\begin{equation*} F_{\mu \nu}=\left(\frac{\partial A_{\nu}}{\partial x^{\mu}}-\frac{\partial A_{\mu}}{\partial x^{\nu}}\right) \tag{42.53} \end{equation*}(42.53)Fμν=(AνxμAμxν)
come choose to write the indices for the position. E E EEE and B B vec(B)\vec{B}B 3-vectors in the f 4 -vions These are not components 4-vectors, so there is no implied dif
Fig. 42.2 A B B BBB-field in the geometrical language circulates in the tubes of a 2 -form. The field F = B x d y d z F = B x d y d z F=B^(x)dy^^dz\boldsymbol{F}=B^{x} \boldsymbol{d} y \wedge \boldsymbol{d} zF=Bxdydz is represented here, with the field component B x B x B^(x)B^{x}Bx circulating in the tube formed by d y d y dy\boldsymbol{d} ydy and d z d z dz\boldsymbol{d} zdz.
as we had identified originally.
Recall that the expression F ~ = d A ~ F ~ = d A ~ tilde(F)=d tilde(A)\tilde{\boldsymbol{F}}=\boldsymbol{d} \tilde{\boldsymbol{A}}F~=dA~ means that the 2 -form F ~ F ~ tilde(F)\tilde{\boldsymbol{F}}F~ is exact and, therefore, closed, with the property that d F ~ = d d A ~ = 0 d F ~ = d d A ~ = 0 d tilde(F)=dd tilde(A)=0\boldsymbol{d} \tilde{\boldsymbol{F}}=\boldsymbol{d} \boldsymbol{d} \tilde{\boldsymbol{A}}=0dF~=ddA~=0.
Expressing electromagnetism in terms of the 2 -form F F F\boldsymbol{F}F allows us to illustrate the theory in a slightly different manner to the usual field lines of E E vec(E)\vec{E}E and B B vec(B)\vec{B}B, by using the idea from Chapter 32 that a 2 -form looks like a set of intersecting planes. Using the wedge product, we can identify the components of the Faraday 2 -form with the components of the electric and magnetic 3 -vectors 26 26 ^(26){ }^{26}26 thus
F ~ = E x d x d t + E y d y d t + E z d z d t (42.54) + B x d y d z + B y d z d x + B z d x d y F ~ = E x d x d t + E y d y d t + E z d z d t (42.54) + B x d y d z + B y d z d x + B z d x d y {:[ tilde(F)=E^(x)dx^^dt+E^(y)dy^^dt+E^(z)dz^^dt],[(42.54)+B^(x)dy^^dz+B^(y)dz^^dx+B^(z)dx^^dy]:}\begin{align*} \tilde{\boldsymbol{F}}= & E^{x} \boldsymbol{d} x \wedge \boldsymbol{d} t+E^{y} \boldsymbol{d} y \wedge \boldsymbol{d} t+E^{z} \boldsymbol{d} z \wedge \boldsymbol{d} t \\ & +B^{x} \boldsymbol{d} y \wedge \boldsymbol{d} z+B^{y} \boldsymbol{d} z \wedge \boldsymbol{d} x+B^{z} \boldsymbol{d} x \wedge \boldsymbol{d} y \tag{42.54} \end{align*}F~=Exdxdt+Eydydt+Ezdzdt(42.54)+Bxdydz+Bydzdx+Bzdxdy
We can therefore picture the components E i E i E^(i)E^{i}Ei and B i B i B^(i)B^{i}Bi circulating in the tubes formed from the intersecting basis 1-forms d x μ d x μ dx^(mu)\boldsymbol{d} x^{\mu}dxμ. An example is shown in Fig. 42.2.
Equation 42.54 allows us access to the components of the equations of motion via eqn 42.12. Specifically, we start with the momentum 1-form
p ~ = p μ d x μ p ~ = p μ d x μ tilde(p)=p_(mu)dx^(mu)\tilde{\boldsymbol{p}}=p_{\mu} \boldsymbol{d} x^{\mu}p~=pμdxμ and the velocity of the particle u u u\boldsymbol{u}u, which is tangent to its world line (parametrized by the proper time τ τ tau\tauτ ). The equation of motion is then d p ~ / d τ = q F ~ ( , u ) d p ~ / d τ = q F ~ ( , u ) d tilde(p)//dtau=q tilde(F)(,u)\mathrm{d} \tilde{\boldsymbol{p}} / \mathrm{d} \tau=q \tilde{\boldsymbol{F}}(, \boldsymbol{u})dp~/dτ=qF~(,u). That is, we fill one of the slots of the 2 -form F F F\boldsymbol{F}F with the velocity u u u\boldsymbol{u}u.
Example 42.9
A particle moves along the z z zzz-axis. In a region where there is a constant magnetic field in the x x xxx-direction, we have a Faraday tensor
(42.55) F ~ ( , ) = B x d y d z ( , ) (42.55) F ~ ( , ) = B x d y d z ( , ) {:(42.55) tilde(F)(",")=B^(x)dy^^dz(","):}\begin{equation*} \tilde{\boldsymbol{F}}(,)=B^{x} \boldsymbol{d} y \wedge \boldsymbol{d} z(,) \tag{42.55} \end{equation*}(42.55)F~(,)=Bxdydz(,)
We find the equation of motion
d p ~ d τ = q F ~ ( , u ) = q B x d y d z ( , u ) = q B x [ d y ( ) d z ( u ) d z ( ) d y ( u ) ] (42.56) = q B x ( d y u z d z u y ) d p ~ d τ = q F ~ ( , u ) = q B x d y d z ( , u ) = q B x [ d y ( ) d z ( u ) d z ( ) d y ( u ) ] (42.56) = q B x d y u z d z u y {:[(d( tilde(p)))/((d)tau)=q tilde(F)(","u)=qB^(x)dy^^dz(","u)],[=qB^(x)[dy()ox dz(u)-dz()ox dy(u)]],[(42.56)=qB^(x)(dyu^(z)-dzu^(y))]:}\begin{align*} \frac{\mathrm{d} \tilde{\boldsymbol{p}}}{\mathrm{~d} \tau}=q \tilde{\boldsymbol{F}}(, \boldsymbol{u}) & =q B^{x} \boldsymbol{d} y \wedge \boldsymbol{d} z(, \boldsymbol{u}) \\ & =q B^{x}[\boldsymbol{d} y() \otimes \boldsymbol{d} z(\boldsymbol{u})-\boldsymbol{d} z() \otimes \boldsymbol{d} y(\boldsymbol{u})] \\ & =q B^{x}\left(\boldsymbol{d} y u^{z}-\boldsymbol{d} z u^{y}\right) \tag{42.56} \end{align*}dp~ dτ=qF~(,u)=qBxdydz(,u)=qBx[dy()dz(u)dz()dy(u)](42.56)=qBx(dyuzdzuy)
We extract equations in component form
d p x d τ = 0 d p y d τ = q B x u z = q B x d z d τ (42.57) d p z d τ = q B x u y = q B x d y d τ d p x d τ = 0 d p y d τ = q B x u z = q B x d z d τ (42.57) d p z d τ = q B x u y = q B x d y d τ {:[(dp_(x))/((d)tau)=0],[((d)p_(y))/((d)tau)=qB^(x)u^(z)=qB^(x)((d)z)/((d)tau)],[(42.57)((d)p_(z))/((d)tau)=-qB^(x)u^(y)=-qB^(x)((d)y)/((d)tau)]:}\begin{align*} & \frac{\mathrm{d} p_{x}}{\mathrm{~d} \tau}=0 \\ & \frac{\mathrm{~d} p_{y}}{\mathrm{~d} \tau}=q B^{x} u^{z}=q B^{x} \frac{\mathrm{~d} z}{\mathrm{~d} \tau} \\ & \frac{\mathrm{~d} p_{z}}{\mathrm{~d} \tau}=-q B^{x} u^{y}=-q B^{x} \frac{\mathrm{~d} y}{\mathrm{~d} \tau} \tag{42.57} \end{align*}dpx dτ=0 dpy dτ=qBxuz=qBx dz dτ(42.57) dpz dτ=qBxuy=qBx dy dτ
The solutions to these equations are the helical trajectories typical for a charge in a magnetic field, as shown in Fig. 42.3.
The geometrical version of the Faraday tensor is also helpful in understanding the gauge transformation in eqn 42.20 which can be rewritten as
(42.58) A μ d x μ A μ d x μ χ x μ d x μ (42.58) A μ d x μ A μ d x μ χ x μ d x μ {:(42.58)A_(mu)dx^(mu)rarrA_(mu)dx^(mu)-(del chi)/(delx^(mu))dx^(mu):}\begin{equation*} A_{\mu} \boldsymbol{d} x^{\mu} \rightarrow A_{\mu} \boldsymbol{d} x^{\mu}-\frac{\partial \chi}{\partial x^{\mu}} \boldsymbol{d} x^{\mu} \tag{42.58} \end{equation*}(42.58)AμdxμAμdxμχxμdxμ
or
(42.59) A ~ A ~ d χ (42.59) A ~ A ~ d χ {:(42.59) tilde(A)rarr tilde(A)-d_(chi):}\begin{equation*} \tilde{A} \rightarrow \tilde{A}-d_{\chi} \tag{42.59} \end{equation*}(42.59)A~A~dχ
It is clear from this geometrical expression that a gauge transformation cannot have any effect on the Faraday 2 -form F ~ F ~ tilde(F)\tilde{\boldsymbol{F}}F~, since
(42.60) F ~ = d A ~ d A ~ d d χ (42.60) F ~ = d A ~ d A ~ d d χ {:(42.60) tilde(F)=d tilde(A)rarr d tilde(A)-dd chi:}\begin{equation*} \tilde{F}=d \tilde{A} \rightarrow d \tilde{A}-d d \chi \tag{42.60} \end{equation*}(42.60)F~=dA~dA~ddχ
and, since d d = 0 d d = 0 dd=0\boldsymbol{d} \boldsymbol{d}=0dd=0, we see that F ~ F ~ F ~ F ~ tilde(F)rarr tilde(F)\tilde{\boldsymbol{F}} \rightarrow \tilde{\boldsymbol{F}}F~F~ under a gauge transformation. Since a gauge transformation has no effect on the Faraday 2-form, it cannot affect the E E vec(E)\vec{E}E - and B B vec(B)\vec{B}B-fields that make up its components.
We now turn to the Maxwell equations. The first two Maxwell equations may be written in terms of the Faraday 2-form as
(42.61) d F ~ = 0 (42.61) d F ~ = 0 {:(42.61)d tilde(F)=0:}\begin{equation*} \boldsymbol{d} \tilde{\boldsymbol{F}}=0 \tag{42.61} \end{equation*}(42.61)dF~=0
which we know to be true by virtue of the definition F ~ = d A ~ F ~ = d A ~ tilde(F)=d tilde(A)\tilde{\boldsymbol{F}}=\boldsymbol{d} \tilde{\boldsymbol{A}}F~=dA~ and the fact that d d = 0 d d = 0 dd=0\boldsymbol{d} \boldsymbol{d}=0dd=0. To obtain the Maxwell equations in a familiar guise, we explicitly take an exterior derivative
(42.62) d F ~ = 1 2 F α β x σ d x σ d x α d x β = 0 . (42.62) d F ~ = 1 2 F α β x σ d x σ d x α d x β = 0 . {:(42.62)d tilde(F)=(1)/(2)(delF_(alpha beta))/(delx^(sigma))dx^(sigma)^^dx^(alpha)^^dx^(beta)=0.:}\begin{equation*} \boldsymbol{d} \tilde{\boldsymbol{F}}=\frac{1}{2} \frac{\partial F_{\alpha \beta}}{\partial x^{\sigma}} \boldsymbol{d} x^{\sigma} \wedge \boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta}=0 . \tag{42.62} \end{equation*}(42.62)dF~=12Fαβxσdxσdxαdxβ=0.
Fig. 42.3 Helical motion of a charged particle in a B B BBB-field.
Example 42.10
Doing this, we see
2 d F ~ = ( B x x + B y y + B z z ) d x d y d z + ( B x t + E z y E y z ) d t d y d z + ( B y t + E x z E z x ) d t d z d x (42.63) + ( B z t + E y x E x y ) d t d x d y = 0 2 d F ~ = B x x + B y y + B z z d x d y d z + B x t + E z y E y z d t d y d z + B y t + E x z E z x d t d z d x (42.63) + B z t + E y x E x y d t d x d y = 0 {:[2d tilde(F)=((delB^(x))/(del x)+(delB^(y))/(del y)+(delB^(z))/(del z))dx^^dy^^dz],[+((delB^(x))/(del t)+(delE^(z))/(del y)*(delE^(y))/(del z))dt^^dy^^dz],[+((delB^(y))/(del t)+(delE^(x))/(del z)-(delE^(z))/(del x))dt^^dz^^dx],[(42.63)+((delB^(z))/(del t)+(delE^(y))/(del x)-(delE^(x))/(del y))dt^^dx^^dy=0]:}\begin{align*} 2 \boldsymbol{d} \tilde{\boldsymbol{F}}= & \left(\frac{\partial B^{x}}{\partial x}+\frac{\partial B^{y}}{\partial y}+\frac{\partial B^{z}}{\partial z}\right) \boldsymbol{d} x \wedge \boldsymbol{d} y \wedge \boldsymbol{d} z \\ & +\left(\frac{\partial B^{x}}{\partial t}+\frac{\partial E^{z}}{\partial y} \cdot \frac{\partial E^{y}}{\partial z}\right) \boldsymbol{d} t \wedge \boldsymbol{d} y \wedge \boldsymbol{d} z \\ & +\left(\frac{\partial B^{y}}{\partial t}+\frac{\partial E^{x}}{\partial z}-\frac{\partial E^{z}}{\partial x}\right) \boldsymbol{d} t \wedge \boldsymbol{d} z \wedge \boldsymbol{d} x \\ & +\left(\frac{\partial B^{z}}{\partial t}+\frac{\partial E^{y}}{\partial x}-\frac{\partial E^{x}}{\partial y}\right) \boldsymbol{d} t \wedge \boldsymbol{d} x \wedge \boldsymbol{d} y=0 \tag{42.63} \end{align*}2dF~=(Bxx+Byy+Bzz)dxdydz+(Bxt+EzyEyz)dtdydz+(Byt+ExzEzx)dtdzdx(42.63)+(Bzt+EyxExy)dtdxdy=0
Set each of the coefficients to zero, to guarantee that the expression always vanishes, and we extract
B = 0 , × E = B t . B = 0 , × E = B t . vec(grad)* vec(B)=0,quad vec(grad)xx vec(E)=-(del( vec(B)))/(del t).\vec{\nabla} \cdot \vec{B}=0, \quad \vec{\nabla} \times \vec{E}=-\frac{\partial \vec{B}}{\partial t} .B=0,×E=Bt.
Another way to express the sum of components in d F ~ = 0 d F ~ = 0 d tilde(F)=0\boldsymbol{d} \tilde{\boldsymbol{F}}=0dF~=0 is to note that, owing to the properties of the wedge product, any term with a repeated index in eqn 42.62 vanishes. As a result, we must have that the sum of the F α β / x γ F α β / x γ delF_(alpha beta)//delx^(gamma)\partial F_{\alpha \beta} / \partial x^{\gamma}Fαβ/xγ with indices all different vanishes. We therefore write an equivalent version of the first two Maxwell equations as
(42.65) F α β x γ + F γ α x β + F β γ x α = 0 (42.65) F α β x γ + F γ α x β + F β γ x α = 0 {:(42.65)(delF_(alpha beta))/(delx^(gamma))+(delF_(gamma alpha))/(delx^(beta))+(delF_(beta gamma))/(delx^(alpha))=0:}\begin{equation*} \frac{\partial F_{\alpha \beta}}{\partial x^{\gamma}}+\frac{\partial F_{\gamma \alpha}}{\partial x^{\beta}}+\frac{\partial F_{\beta \gamma}}{\partial x^{\alpha}}=0 \tag{42.65} \end{equation*}(42.65)Fαβxγ+Fγαxβ+Fβγxα=0
which is the Bianchi identity for electromagnetism that we gave in eqn 42.24. We'll examine its physical and geometrical significance further in the next chapter.
In order to recreate the other two Maxwell equations, we must find out how the current J J J\boldsymbol{J}J is coupled to the electromagnetic fields. The way to couple the fields is not to use F F F\boldsymbol{F}F, but the 2-form that results from taking the dual of the ( 2 , 0 ) ( 2 , 0 ) (2,0)(2,0)(2,0) tensor F F F\boldsymbol{F}F, which is 27 27 ^(27){ }^{27}27
(42.66) F = 1 2 ( F ) α β d x α d x β . (42.66) F = 1 2 ( F ) α β d x α d x β . {:(42.66)***F=(1)/(2)(***F)_(alpha beta)dx^(alpha)^^dx^(beta).:}\begin{equation*} \star \boldsymbol{F}=\frac{1}{2}(\star \boldsymbol{F})_{\alpha \beta} \boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta} . \tag{42.66} \end{equation*}(42.66)F=12(F)αβdxαdxβ.
27 27 ^(27){ }^{27}27 See Chapter 37 for the recipe for forming duals. An alternative method to the one used here is to take the dual of the ( 0 , 2 ) ( 0 , 2 ) (0,2)(0,2)(0,2) Faraday 2 -form F ~ F ~ tilde(F)\tilde{\boldsymbol{F}}F~ to make the ( 2 , 0 ) ( 2 , 0 ) (2,0)(2,0)(2,0) Maxwell tensor F F ***F\star \boldsymbol{F}F. We would then need to lower the indices to construct the components of the 2 -form.
The object F F ***F\star \boldsymbol{F}F is known as the Maxwell tensor.
Example 42.11
The dual of the (2,0) Faraday tensor F = 1 2 F μ ν ( e μ e ν ) F = 1 2 F μ ν e μ e ν F=(1)/(2)F^(mu nu)(e_(mu)^^e_(nu))\boldsymbol{F}=\frac{1}{2} F^{\mu \nu}\left(\boldsymbol{e}_{\mu} \wedge \boldsymbol{e}_{\nu}\right)F=12Fμν(eμeν) is a 2-form with components
(42.67) ( F ) α β 1 2 ε μ ν α β F μ ν = ε | μ ν | α β F μ ν . (42.67) ( F ) α β 1 2 ε μ ν α β F μ ν = ε | μ ν | α β F μ ν . {:(42.67)(***F)_(alpha beta)-=(1)/(2)*epsi_(mu nu alpha beta)F^(mu nu)=epsi_(|mu nu|alpha beta)F^(mu nu).:}\begin{equation*} (\star \boldsymbol{F})_{\alpha \beta} \equiv \frac{1}{2} \cdot \varepsilon_{\mu \nu \alpha \beta} F^{\mu \nu}=\varepsilon_{|\mu \nu| \alpha \beta} F^{\mu \nu} . \tag{42.67} \end{equation*}(42.67)(F)αβ12εμναβFμν=ε|μν|αβFμν.
We obtain
( F ) 01 = ε 2301 F 23 = B x , ( F ) 02 = ε 1302 F 13 = B y , ( F ) 03 = ε 1203 F 12 = B z , ( F ) 23 = ε 0123 F 01 = E x , ( F ) 13 = ε 0213 F 02 = E y , (42.68) ( F ) 12 = ε 0312 F 03 = E z . ( F ) 01 = ε 2301 F 23 = B x , ( F ) 02 = ε 1302 F 13 = B y , ( F ) 03 = ε 1203 F 12 = B z , ( F ) 23 = ε 0123 F 01 = E x , ( F ) 13 = ε 0213 F 02 = E y , (42.68) ( F ) 12 = ε 0312 F 03 = E z . {:[(***F)_(01)=epsi_(2301)F^(23)=B^(x)","],[(***F)_(02)=epsi_(1302)F^(13)=B^(y)","],[(***F)_(03)=epsi_(1203)F^(12)=B^(z)","],[(***F)_(23)=epsi_(0123)F^(01)=E^(x)","],[(***F)_(13)=epsi_(0213)F^(02)=-E^(y)","],[(42.68)(***F)_(12)=epsi_(0312)F^(03)=E^(z).]:}\begin{align*} & (\star \boldsymbol{F})_{01}=\varepsilon_{2301} F^{23}=B^{x}, \\ & (\star \boldsymbol{F})_{02}=\varepsilon_{1302} F^{13}=B^{y}, \\ & (\star \boldsymbol{F})_{03}=\varepsilon_{1203} F^{12}=B^{z}, \\ & (\star \boldsymbol{F})_{23}=\varepsilon_{0123} F^{01}=E^{x}, \\ & (\star \boldsymbol{F})_{13}=\varepsilon_{0213} F^{02}=-E^{y}, \\ & (\star \boldsymbol{F})_{12}=\varepsilon_{0312} F^{03}=E^{z} . \tag{42.68} \end{align*}(F)01=ε2301F23=Bx,(F)02=ε1302F13=By,(F)03=ε1203F12=Bz,(F)23=ε0123F01=Ex,(F)13=ε0213F02=Ey,(42.68)(F)12=ε0312F03=Ez.
This 2-form has components 28 28 ^(28){ }^{28}28
( F ) μ ν = ( 0 B x B y B z B x 0 E z E y B y E z 0 E x B z E y E x 0 ) ( F ) μ ν = 0 B x B y B z B x 0 E z E y B y E z 0 E x B z E y E x 0 (***F)_(mu nu)=([0,B^(x),B^(y),B^(z)],[-B^(x),0,E^(z),-E^(y)],[-B^(y),-E^(z),0,E^(x)],[-B^(z),E^(y),-E^(x),0])(\star \boldsymbol{F})_{\mu \nu}=\left(\begin{array}{cccc} 0 & B^{x} & B^{y} & B^{z} \\ -B^{x} & 0 & E^{z} & -E^{y} \\ -B^{y} & -E^{z} & 0 & E^{x} \\ -B^{z} & E^{y} & -E^{x} & 0 \end{array}\right)(F)μν=(0BxByBzBx0EzEyByEz0ExBzEyEx0)
(42.69)
So the change from the 2 -form F ~ F ~ tilde(F)\tilde{\boldsymbol{F}}F~ to the 2 -form F F ***F\star \boldsymbol{F}F can be achieved by making a transformation of components corresponding to E E vec(E)\vec{E}E changing to B B - vec(B)-\vec{B}B and B B vec(B)\vec{B}B changing to E E vec(E)\vec{E}E. It is also straightforward to show that F = F F = F ******F=-F\star \star \boldsymbol{F}=-\boldsymbol{F}F=F.
The Maxwell 2-form is written in terms of the 3 -vector E E vec(E)\vec{E}E and B B vec(B)\vec{B}B fields as
F = E x d y d z + E y d z d x + E z d x d y (42.70) + B x d t d x + B y d t d y + B z d t d z F = E x d y d z + E y d z d x + E z d x d y (42.70) + B x d t d x + B y d t d y + B z d t d z {:[***F=E^(x)dy^^dz+E^(y)dz^^dx+E^(z)dx^^dy],[(42.70)+B^(x)dt^^dx+B^(y)dt^^dy+B^(z)dt^^dz]:}\begin{align*} \star \boldsymbol{F}= & E^{x} \boldsymbol{d} y \wedge \boldsymbol{d} z+E^{y} \boldsymbol{d} z \wedge \boldsymbol{d} x+E^{z} \boldsymbol{d} x \wedge \boldsymbol{d} y \\ & +B^{x} \boldsymbol{d} t \wedge \boldsymbol{d} x+B^{y} \boldsymbol{d} t \wedge \boldsymbol{d} y+B^{z} \boldsymbol{d} t \wedge \boldsymbol{d} z \tag{42.70} \end{align*}F=Exdydz+Eydzdx+Ezdxdy(42.70)+Bxdtdx+Bydtdy+Bzdtdz
We shall see in the next chapter that the conservation of charge that is so fundamental to electromagnetism is expressed by the dual of the current J J ***J\star \boldsymbol{J}J. It is this ( 0 , 3 ) ( 0 , 3 ) (0,3)(0,3)(0,3) tensor that couples to the 2 -form F F ***F\star \boldsymbol{F}F to give the second pair of Maxwell's equations.

Example 42.12

Let's take the dual of the current vector J J J\boldsymbol{J}J. In n = 4 n = 4 n=4n=4n=4 dimensions, this results in a 3 -form with components J α β γ = ε μ α β γ J μ J α β γ = ε μ α β γ J μ J_(alpha beta gamma)=epsi_(mu alpha beta gamma)J^(mu)J_{\alpha \beta \gamma}=\varepsilon_{\mu \alpha \beta \gamma} J^{\mu}Jαβγ=εμαβγJμ given by
( J ) 123 = ε 0123 J 0 = ρ ( J ) 023 = ε 1023 J 1 = J x ( J ) 031 = ε 2031 J 2 = J y (42.71) ( J ) 012 = ε 3012 J 3 = J z ( J ) 123 = ε 0123 J 0 = ρ ( J ) 023 = ε 1023 J 1 = J x ( J ) 031 = ε 2031 J 2 = J y (42.71) ( J ) 012 = ε 3012 J 3 = J z {:[(***J)_(123)=epsi_(0123)J^(0)=rho],[(***J)_(023)=epsi_(1023)J^(1)=-J^(x)],[(***J)_(031)=epsi_(2031)J^(2)=-J^(y)],[(42.71)(***J)_(012)=epsi_(3012)J^(3)=-J^(z)]:}\begin{align*} & (\star J)_{123}=\varepsilon_{0123} J^{0}=\rho \\ & (\star J)_{023}=\varepsilon_{1023} J^{1}=-J^{x} \\ & (\star J)_{031}=\varepsilon_{2031} J^{2}=-J^{y} \\ & (\star J)_{012}=\varepsilon_{3012} J^{3}=-J^{z} \tag{42.71} \end{align*}(J)123=ε0123J0=ρ(J)023=ε1023J1=Jx(J)031=ε2031J2=Jy(42.71)(J)012=ε3012J3=Jz
So we have the 3 -form
J = ρ d x d y d z J x d t d y d z (42.72) J y d t d z d x J z d t d x d y . J = ρ d x d y d z J x d t d y d z (42.72) J y d t d z d x J z d t d x d y . {:[***J=rho dx^^dy^^dz-J^(x)dt^^dy^^dz],[(42.72)-J^(y)dt^^dz^^dx-J^(z)dt^^dx^^dy.]:}\begin{align*} \star \boldsymbol{J}= & \rho \boldsymbol{d} x \wedge \boldsymbol{d} y \wedge \boldsymbol{d} z-J^{x} \boldsymbol{d} t \wedge \boldsymbol{d} y \wedge \boldsymbol{d} z \\ & -J^{y} \boldsymbol{d} t \wedge \boldsymbol{d} z \wedge \boldsymbol{d} x-J^{z} \boldsymbol{d} t \wedge \boldsymbol{d} x \wedge \boldsymbol{d} y . \tag{42.72} \end{align*}J=ρdxdydzJxdtdydz(42.72)JydtdzdxJzdtdxdy.
The final piece of the puzzle is to determine how to couple the 2 -form F F ***F\star \boldsymbol{F}F to the 3 -form J J ***J\star \boldsymbol{J}J. The answer is to make another 3 -form by taking the exterior derivative of the Maxwell 2-form 29 F 29 F ^(29)***F{ }^{29} \star \boldsymbol{F}29F.
Example 42.13
Take the exterior derivative of the Maxwell 2-form to make a 3-form
d F = ( E x x + E y y + E z z ) d x d y d z + ( E x t + B z y B y z ) d t d y d z + ( E y t + B x z B z x ) d t d z d x (42.73) + ( E z t + B y x B x y ) d t d x d y d F = E x x + E y y + E z z d x d y d z + E x t + B z y B y z d t d y d z + E y t + B x z B z x d t d z d x (42.73) + E z t + B y x B x y d t d x d y {:[d***F=((delE_(x))/(del x)+(delE_(y))/(del y)+(delE_(z))/(del z))dx^^dy^^dz],[+((delE_(x))/(del t)+(delB_(z))/(del y)-(delB_(y))/(del z))dt^^dy^^dz],[+((delE_(y))/(del t)+(delB_(x))/(del z)-(delB_(z))/(del x))dt^^dz^^dx],[(42.73)+((delE_(z))/(del t)+(delB_(y))/(del x)-(delB_(x))/(del y))dt^^dx^^dy]:}\begin{align*} \boldsymbol{d} \star \boldsymbol{F}= & \left(\frac{\partial E_{x}}{\partial x}+\frac{\partial E_{y}}{\partial y}+\frac{\partial E_{z}}{\partial z}\right) \boldsymbol{d} x \wedge \boldsymbol{d} y \wedge \boldsymbol{d} z \\ & +\left(\frac{\partial E_{x}}{\partial t}+\frac{\partial B_{z}}{\partial y}-\frac{\partial B_{y}}{\partial z}\right) \boldsymbol{d} t \wedge \boldsymbol{d} y \wedge \boldsymbol{d} z \\ & +\left(\frac{\partial E_{y}}{\partial t}+\frac{\partial B_{x}}{\partial z}-\frac{\partial B_{z}}{\partial x}\right) \boldsymbol{d} t \wedge \boldsymbol{d} z \wedge \boldsymbol{d} x \\ & +\left(\frac{\partial E_{z}}{\partial t}+\frac{\partial B_{y}}{\partial x}-\frac{\partial B_{x}}{\partial y}\right) \boldsymbol{d} t \wedge \boldsymbol{d} x \wedge \boldsymbol{d} y \tag{42.73} \end{align*}dF=(Exx+Eyy+Ezz)dxdydz+(Ext+BzyByz)dtdydz+(Eyt+BxzBzx)dtdzdx(42.73)+(Ezt+ByxBxy)dtdxdy
28 28 ^(28){ }^{28}28 The corresponding (2,0) tensor has components
Match each component with that of the 3 -form J J ***J\star \boldsymbol{J}J
J = ρ d x d y d z J x d t d y d z (42.74) J y d t d z d x J z d t d x d y . J = ρ d x d y d z J x d t d y d z (42.74) J y d t d z d x J z d t d x d y . {:[***J=rho dx^^dy^^dz-J^(x)dt^^dy^^dz],[(42.74)-J^(y)dt^^dz^^dx-J^(z)dt^^dx^^dy.]:}\begin{align*} \star \boldsymbol{J}= & \rho \boldsymbol{d} x \wedge \boldsymbol{d} y \wedge \boldsymbol{d} z-J^{x} \boldsymbol{d} t \wedge \boldsymbol{d} y \wedge \boldsymbol{d} z \\ & -J^{y} \boldsymbol{d} t \wedge \boldsymbol{d} z \wedge \boldsymbol{d} x-J^{z} \boldsymbol{d} t \wedge \boldsymbol{d} x \wedge \boldsymbol{d} y . \tag{42.74} \end{align*}J=ρdxdydzJxdtdydz(42.74)JydtdzdxJzdtdxdy.
This results in the remaining Maxwell equations.
We conclude that the second two Maxwell equations are given by
(42.75) d F = J (42.75) d F = J {:(42.75)d***F=***J:}\begin{equation*} \boldsymbol{d} \star \boldsymbol{F}=\star \boldsymbol{J} \tag{42.75} \end{equation*}(42.75)dF=J
We have now built a geometrical version of the Maxwell equations. In the next chapter, we shall look a little more closely at the notion of charge conservation and see how this allows us to understand the Maxwell equations in their geometric form.

Chapter summary

  • Interactions of changes with electromagnetic fields are described in flat space by the equation of motion
(42.76) d p ~ d τ = q F ~ ( , u ) (42.76) d p ~ d τ = q F ~ ( , u ) {:(42.76)(d( tilde(p)))/((d)tau)=q tilde(F)(","u):}\begin{equation*} \frac{\mathrm{d} \tilde{\boldsymbol{p}}}{\mathrm{~d} \tau}=q \tilde{\boldsymbol{F}}(, \boldsymbol{u}) \tag{42.76} \end{equation*}(42.76)dp~ dτ=qF~(,u)
  • The Faraday tensor is the 2 -form F ~ = d A ~ F ~ = d A ~ tilde(F)=d tilde(A)\tilde{\boldsymbol{F}}=\boldsymbol{d} \tilde{\boldsymbol{A}}F~=dA~ where A ~ A ~ tilde(A)\tilde{\boldsymbol{A}}A~ is the gauge field.
  • In coordinate free-notation, the Maxwell equations can be expressed as
(42.77) d F ~ = 0 , d F = J (42.77) d F ~ = 0 , d F = J {:(42.77)d tilde(F)=0","quad d***F=***J:}\begin{equation*} \boldsymbol{d} \tilde{\boldsymbol{F}}=0, \quad \boldsymbol{d} \star \boldsymbol{F}=\star \boldsymbol{J} \tag{42.77} \end{equation*}(42.77)dF~=0,dF=J
  • The ( 0 , 2 ) ( 0 , 2 ) (0,2)(0,2)(0,2) energy-momentum tensor T ( T ( T(\boldsymbol{T}(T(, ) f o r t h e e l e c t r o m a g n e t i c ) f o r t h e e l e c t r o m a g n e t i c )fortheelectromagnetic) for the electromagnetic)fortheelectromagnetic field has components
(42.78) T μ ν = F μ α F ν β g α β 1 4 g μ ν F γ δ F λ ρ g γ λ g δ ρ (42.78) T μ ν = F μ α F ν β g α β 1 4 g μ ν F γ δ F λ ρ g γ λ g δ ρ {:(42.78)T_(mu nu)=F_(mu alpha)F_(nu beta)g^(alpha beta)-(1)/(4)g_(mu nu)F_(gamma delta)F_(lambda rho)g^(gamma lambda)g^(delta rho):}\begin{equation*} T_{\mu \nu}=F_{\mu \alpha} F_{\nu \beta} g^{\alpha \beta}-\frac{1}{4} g_{\mu \nu} F_{\gamma \delta} F_{\lambda \rho} g^{\gamma \lambda} g^{\delta \rho} \tag{42.78} \end{equation*}(42.78)Tμν=FμαFνβgαβ14gμνFγδFλρgγλgδρ

Exercises

(42.1) Show that the equation of motion in eqn 42.12 is consistent with the Lorentz force law.
(42.2) Use the fact that under a Lorentz transformation
(42.79) F μ ν ( F ) μ ν = Λ α μ Λ β ν F α β (42.79) F μ ν F μ ν = Λ α μ Λ β ν F α β {:(42.79)F^(mu nu)rarr(F^('))^(mu nu)=Lambda_(alpha)^(mu)Lambda_(beta)^(nu)F^(alpha beta):}\begin{equation*} F^{\mu \nu} \rightarrow\left(F^{\prime}\right)^{\mu \nu}=\Lambda_{\alpha}^{\mu} \Lambda_{\beta}^{\nu} F^{\alpha \beta} \tag{42.79} \end{equation*}(42.79)Fμν(F)μν=ΛαμΛβνFαβ
to show how the components of E E vec(E)\vec{E}E and B B vec(B)\vec{B}B transform when a boost is made along the x x xxx axis.
(42.3) (a) Why in electronics are we free to set the zero of electric potential? Are we allowed to set zero volts differently in London and in Manchester? (b) What is the effect on the equations for the E E vec(E)\vec{E}E -
and B B vec(B)\vec{B}B-fields of making the changes to the components A μ = ( V , A ) A μ = ( V , A ) A^(mu)=(V, vec(A))A^{\mu}=(V, \vec{A})Aμ=(V,A) of the electromagnetic field A ( x ) A ( x ) A(x)\boldsymbol{A}(x)A(x) of
V V χ t (42.80) A A + χ V V χ t (42.80) A A + χ {:[V rarr V-(del chi)/(del t)],[(42.80) vec(A)rarr vec(A)+ vec(grad)chi]:}\begin{align*} & V \rightarrow V-\frac{\partial \chi}{\partial t} \\ & \vec{A} \rightarrow \vec{A}+\vec{\nabla} \chi \tag{42.80} \end{align*}VVχt(42.80)AA+χ
where χ χ chi\chiχ is a function of space and time coordinates?
(c) Consider Maxwell's equations given in terms of the field A A A\boldsymbol{A}A. What happens to the equations if we choose V V VVV and A A vec(A)\vec{A}A such that
(42.81) A + V t = 0 (42.81) A + V t = 0 {:(42.81) vec(grad)* vec(A)+(del V)/(del t)=0:}\begin{equation*} \vec{\nabla} \cdot \vec{A}+\frac{\partial V}{\partial t}=0 \tag{42.81} \end{equation*}(42.81)A+Vt=0
(42.4) (a) In flat space, what B B vec(B)\vec{B}B-field results from a vector potential with components A = ( 0 , C x , 0 ) A = ( 0 , C x , 0 ) vec(A)=(0,Cx,0)\vec{A}=(0, C x, 0)A=(0,Cx,0), where C C CCC is a constant?
This is known as Landau gauge.
(b) What B B vec(B)\vec{B}B-field results from a vector potential A = 1 2 ( C y , C x , 0 ) A = 1 2 ( C y , C x , 0 ) vec(A)=(1)/(2)(-Cy,Cx,0)\vec{A}=\frac{1}{2}(-C y, C x, 0)A=12(Cy,Cx,0), where C C CCC is a constant?
(42.5) (a) Show that F α β , γ + F γ α , β + F β γ , α = 0 F α β , γ + F γ α , β + F β γ , α = 0 F_(alpha beta,gamma)+F_(gamma alpha,beta)+F_(beta gamma,alpha)=0F_{\alpha \beta, \gamma}+F_{\gamma \alpha, \beta}+F_{\beta \gamma, \alpha}=0Fαβ,γ+Fγα,β+Fβγ,α=0, yields two Maxwell equations.
(b) Show that F μ ν , ν = J μ F μ ν , ν = J μ F^(mu nu)_(,nu)=J^(mu)F^{\mu \nu}{ }_{, \nu}=J^{\mu}Fμν,ν=Jμ yields the other two.
(42.6) Derive the expressions linking the components of T T TTT to the E E vec(E)\vec{E}E and B B vec(B)\vec{B}B fields.
(42.7) By considering the spatial Laplacian of a component 2 A μ 2 A μ vec(grad)^(2)A^(mu)\vec{\nabla}^{2} A^{\mu}2Aμ, show that the retarded potential in eqn 42.50 satisfies the inhomogeneous wave equation 2 A μ ( t , x μ ) = J μ ( t , x μ ) 2 A μ t , x μ = J μ t , x μ del^(2)A^(mu)(t,x^(mu))=-J^(mu)(t,x^(mu))\partial^{2} A^{\mu}\left(t, x^{\mu}\right)=-J^{\mu}\left(t, x^{\mu}\right)2Aμ(t,xμ)=Jμ(t,xμ).
Hint: Use the identities R = R ^ , ( 1 / R ) = R = R ^ , ( 1 / R ) = vec(grad)R= hat(vec(R)), vec(grad)(1//R)=\vec{\nabla} R=\hat{\vec{R}}, \vec{\nabla}(1 / R)=R=R^,(1/R)= R ^ / R 2 , ( R ^ / R ) = 1 / R 2 R ^ / R 2 , ( R ^ / R ) = 1 / R 2 - hat(vec(R))//R^(2), vec(grad)*( hat(vec(R))//R)=1//R^(2)-\hat{\vec{R}} / R^{2}, \vec{\nabla} \cdot(\hat{\vec{R}} / R)=1 / R^{2}R^/R2,(R^/R)=1/R2 and ( R ^ / R 2 ) = R ^ / R 2 = vec(grad)*(( hat(vec(R)))//R^(2))=\vec{\nabla} \cdot\left(\hat{\vec{R}} / R^{2}\right)=(R^/R2)= 4 π δ ( 3 ) ( R ) 4 π δ ( 3 ) ( R ) 4pidelta^((3))( vec(R))4 \pi \delta^{(3)}(\vec{R})4πδ(3)(R), where R = | x x | R = x x R=|( vec(x))- vec(x)^(')|R=\left|\vec{x}-\vec{x}^{\prime}\right|R=|xx|.
(42.8) We can show that eqn 42.50 transforms as a 4 vector in flat space. Consider field A μ A μ A^(mu)A^{\mu}Aμ at point P P P\mathcal{P}P at ( 0 , 0 , 0 , 0 ) ( 0 , 0 , 0 , 0 ) (0,0,0,0)(0,0,0,0)(0,0,0,0) resulting from light travelling from the current J μ J μ J^(mu)J^{\mu}Jμ at points Q Q Q\mathcal{Q}Q at ( t , x , y , z ) ( t , x , y , z ) (-t,x,y,z)(-t, x, y, z)(t,x,y,z). We have
(42.82) A μ ( P ) = 1 4 π d V J μ ( Q ) | r | (42.82) A μ ( P ) = 1 4 π d V J μ ( Q ) | r | {:(42.82)A^(mu)(P)=(1)/(4pi)intdV(J^(mu)(Q))/(|( vec(r))|):}\begin{equation*} A^{\mu}(\mathcal{P})=\frac{1}{4 \pi} \int \mathrm{~d} \mathcal{V} \frac{J^{\mu}(\mathcal{Q})}{|\vec{r}|} \tag{42.82} \end{equation*}(42.82)Aμ(P)=14π dVJμ(Q)|r|
where r 2 = x 2 + y 2 + z 2 , | r | = t r 2 = x 2 + y 2 + z 2 , | r | = t r^(2)=x^(2)+y^(2)+z^(2),| vec(r)|=-tr^{2}=x^{2}+y^{2}+z^{2},|\vec{r}|=-tr2=x2+y2+z2,|r|=t (and we drop the primed notation for the volume element d V d V dVd \mathcal{V}dV to avoid confusion with the primed coordinates examined in this problem!).
(a) In a primed frame moving at speed v v vvv along the x x xxx-direction relative to the unprimed frame, show that
(42.83) r = γ r ( 1 + v cos θ ) (42.83) r = γ r ( 1 + v cos θ ) {:(42.83)r^(')=gamma r(1+v cos theta):}\begin{equation*} r^{\prime}=\gamma r(1+v \cos \theta) \tag{42.83} \end{equation*}(42.83)r=γr(1+vcosθ)
where γ = ( 1 v ) 1 2 γ = ( 1 v ) 1 2 gamma=(1-v)^(-(1)/(2))\gamma=(1-v)^{-\frac{1}{2}}γ=(1v)12 and θ θ theta\thetaθ is the angle between r r vec(r)\vec{r}r and the x x xxx axis.
(b) The three-volume d V d V dV\mathrm{d} \mathcal{V}dV captures all of the events that intersect a spherical light wave launched backward in time from P P P\mathcal{P}P. Show that the volume element in the primed frame is given by d V = d V = dV^(')=\mathrm{d} \mathcal{V}^{\prime}=dV= d x d y d z = d x d y d z d x d y d z = d x d y d z dx^(')dy^(')dz^(')=dx^(')dydz\mathrm{d} x^{\prime} \mathrm{d} y^{\prime} \mathrm{d} z^{\prime}=\mathrm{d} x^{\prime} \mathrm{d} y \mathrm{~d} zdxdydz=dxdy dz, with
(42.84) d x = γ ( d x v d t ) (42.84) d x = γ ( d x v d t ) {:(42.84)dx^(')=gamma(dx-vdt):}\begin{equation*} \mathrm{d} x^{\prime}=\gamma(\mathrm{d} x-v \mathrm{~d} t) \tag{42.84} \end{equation*}(42.84)dx=γ(dxv dt)
(c) In the previous equation, d t d t dt\mathrm{d} tdt gives the difference in times for measuring the extremities of the interval d x d x dx\mathrm{d} xdx. Use this to show that
(42.85) d x = γ d x ( 1 + v cos θ ) (42.85) d x = γ d x ( 1 + v cos θ ) {:(42.85)dx^(')=gammadx(1+v cos theta):}\begin{equation*} \mathrm{d} x^{\prime}=\gamma \mathrm{d} x(1+v \cos \theta) \tag{42.85} \end{equation*}(42.85)dx=γdx(1+vcosθ)
and, as a result, that the eqn 42.50 transforms as a 4 -vector.
(42.9) Show that the solution to the equations of motion in eqn 42.57 are helices.
(42.10) We define projection tensors with components
(42.86) P L μ ν = p μ p ν p 2 and P T μ ν = g μ ν p μ p ν p 2 (42.86) P L μ ν = p μ p ν p 2  and  P T μ ν = g μ ν p μ p ν p 2 {:(42.86)P_(L)^(mu nu)=(p^(mu)p^(nu))/(p^(2))quad" and "quadP_(T)^(mu nu)=g^(mu nu)-(p^(mu)p^(nu))/(p^(2)):}\begin{equation*} P_{\mathrm{L}}^{\mu \nu}=\frac{p^{\mu} p^{\nu}}{p^{2}} \quad \text { and } \quad P_{\mathrm{T}}^{\mu \nu}=g^{\mu \nu}-\frac{p^{\mu} p^{\nu}}{p^{2}} \tag{42.86} \end{equation*}(42.86)PLμν=pμpνp2 and PTμν=gμνpμpνp2
where p μ p μ p^(mu)p^{\mu}pμ are the components of a momentum vector. Verify that P L P L P_(L)P_{\mathrm{L}}PL and P T P T P_(T)P_{\mathrm{T}}PT are indeed projection operators by showing that P 2 = P P 2 = P P^(2)=PP^{2}=PP2=P.
(42.11) The Lagrangian for electromagnetism in vacuo is, in Minkowski space, L = 1 4 F μ ν F μ ν L = 1 4 F μ ν F μ ν L=-(1)/(4)F^(mu nu)F_(mu nu)\mathcal{L}=-\frac{1}{4} F^{\mu \nu} F_{\mu \nu}L=14FμνFμν.
(a) Show that this can be rewritten as
(42.87) L = 1 2 ( μ A ν μ A ν μ A ν ν A μ ) (42.87) L = 1 2 μ A ν μ A ν μ A ν ν A μ {:(42.87)L=-(1)/(2)(del_(mu)A_(nu)del^(mu)A^(nu)-del_(mu)A_(nu)del^(nu)A^(mu)):}\begin{equation*} \mathcal{L}=-\frac{1}{2}\left(\partial_{\mu} A_{\nu} \partial^{\mu} A^{\nu}-\partial_{\mu} A_{\nu} \partial^{\nu} A^{\mu}\right) \tag{42.87} \end{equation*}(42.87)L=12(μAνμAνμAννAμ)
(b) We define a generalized momentum density with components Π σ ρ Π σ ρ Pi^(sigma rho)\Pi^{\sigma \rho}Πσρ. Show that
(42.88) Π σ ρ L ( σ A ρ ) = F σ ρ (42.88) Π σ ρ L σ A ρ = F σ ρ {:(42.88)Pi^(sigma rho)-=(delL)/(del(del_(sigma)A_(rho)))=-F^(sigma rho):}\begin{equation*} \Pi^{\sigma \rho} \equiv \frac{\partial \mathcal{L}}{\partial\left(\partial_{\sigma} A_{\rho}\right)}=-F^{\sigma \rho} \tag{42.88} \end{equation*}(42.88)ΠσρL(σAρ)=Fσρ
(c) Hence, show that the canonical energymomentum tensor
(42.89) S ν μ = Π μ σ ν A σ + δ μ ν L (42.89) S ν μ = Π μ σ ν A σ + δ μ ν L {:(42.89)S_(nu)^(mu)=-Pi^(mu sigma)del_(nu)A_(sigma)+delta^(mu)_(nu)L:}\begin{equation*} S_{\nu}^{\mu}=-\Pi^{\mu \sigma} \partial_{\nu} A_{\sigma}+\delta^{\mu}{ }_{\nu} \mathcal{L} \tag{42.89} \end{equation*}(42.89)Sνμ=ΠμσνAσ+δμνL
can be written as
(42.90) S μ ν = F μ σ ν A σ 1 4 η μ ν F α β F α β (42.90) S μ ν = F μ σ ν A σ 1 4 η μ ν F α β F α β {:(42.90)S^(mu nu)=F^(mu sigma)del^(nu)A_(sigma)-(1)/(4)eta^(mu nu)F^(alpha beta)F_(alpha beta):}\begin{equation*} S^{\mu \nu}=F^{\mu \sigma} \partial^{\nu} A_{\sigma}-\frac{1}{4} \eta^{\mu \nu} F^{\alpha \beta} F_{\alpha \beta} \tag{42.90} \end{equation*}(42.90)Sμν=FμσνAσ14ημνFαβFαβ
42.12) The canonical energy-momentum tensor from the last question is not symmetric (and so cannot be used for general relativity). However, we can symmetrize it by adding an extra term λ X λ μ ν λ X λ μ ν del_(lambda)X^(lambda mu nu)\partial_{\lambda} X^{\lambda \mu \nu}λXλμν, where X λ μ ν = F μ λ A ν X λ μ ν = F μ λ A ν X^(lambda mu nu)=-F^(mu lambda)A^(nu)X^{\lambda \mu \nu}=-F^{\mu \lambda} A^{\nu}Xλμν=FμλAν. Show that a symmetrized energy-momentum tensor T μ ν = S μ ν + λ X λ μ ν T μ ν = S μ ν + λ X λ μ ν T^(mu nu)=S^(mu nu)+del_(lambda)X^(lambda mu nu)T^{\mu \nu}=S^{\mu \nu}+\partial_{\lambda} X^{\lambda \mu \nu}Tμν=Sμν+λXλμν can be written as
(42.91) T μ ν = F μ σ F σ ν 1 4 η μ ν F α β F α β , (42.91) T μ ν = F μ σ F σ ν 1 4 η μ ν F α β F α β , {:(42.91)T^(mu nu)=F^(mu sigma)F_(sigma)^(nu)-(1)/(4)eta^(mu nu)F^(alpha beta)F_(alpha beta)",":}\begin{equation*} T^{\mu \nu}=F^{\mu \sigma} F_{\sigma}^{\nu}-\frac{1}{4} \eta^{\mu \nu} F^{\alpha \beta} F_{\alpha \beta}, \tag{42.91} \end{equation*}(42.91)Tμν=FμσFσν14ημνFαβFαβ,
as we had before.
(42.13) We can attempt to formulate a scalar field theory for gravitation using the same method we used for electromagnetism. We follow the approach of Padmanabhan.
Consider the action for the coupling between the scalar field ϕ ( x μ ) ϕ x μ phi(x^(mu))\phi\left(x^{\mu}\right)ϕ(xμ) and matter
(42.92) S = d τ ( m + λ ϕ ) ( η μ ν d x μ d τ d x ν d τ ) 1 2 (42.92) S = d τ ( m + λ ϕ ) η μ ν d x μ d τ d x ν d τ 1 2 {:(42.92)S=-intdtau(m+lambda phi)(-eta_(mu nu)(dx^(mu))/(dtau)*((d)x^(nu))/(dtau))^((1)/(2)):}\begin{equation*} S=-\int \mathrm{d} \tau(m+\lambda \phi)\left(-\eta_{\mu \nu} \frac{\mathrm{d} x^{\mu}}{\mathrm{d} \tau} \cdot \frac{\mathrm{~d} x^{\nu}}{\mathrm{d} \tau}\right)^{\frac{1}{2}} \tag{42.92} \end{equation*}(42.92)S=dτ(m+λϕ)(ημνdxμdτ dxνdτ)12
(a) Verify that this is a sensible suggestion by restoring factors of c c ccc, and expanding the La- (42.14) We can examine the equation of motion of the field grangian in the non-relativistic limit to O ( 1 / c 2 ) O 1 / c 2 O(1//c^(2))O\left(1 / c^{2}\right)O(1/c2). (b) Show that the equation of motion for a particle is given by
(42.93) d u μ d τ = λ ( m + λ ϕ ) ( ϕ , μ + u μ u σ ϕ , σ ) (42.93) d u μ d τ = λ ( m + λ ϕ ) ϕ , μ + u μ u σ ϕ , σ {:(42.93)(du_(mu))/(dtau)=-(lambda)/((m+lambda phi))(phi_(,mu)+u_(mu)u^(sigma)phi_(,sigma)):}\begin{equation*} \frac{\mathrm{d} u_{\mu}}{\mathrm{d} \tau}=-\frac{\lambda}{(m+\lambda \phi)}\left(\phi_{, \mu}+u_{\mu} u^{\sigma} \phi_{, \sigma}\right) \tag{42.93} \end{equation*}(42.93)duμdτ=λ(m+λϕ)(ϕ,μ+uμuσϕ,σ)
(c) If ϕ ϕ phi\phiϕ describes gravitation then the equation of motion must obey the principle of equivalence. By rescaling the field ϕ ϕ phi\phiϕ, show that action must take
the form
(42.94) S = d τ ( 1 + Φ ) ( η μ ν d x μ d τ d x ν d τ ) 1 2 (42.94) S = d τ ( 1 + Φ ) η μ ν d x μ d τ d x ν d τ 1 2 {:(42.94)S=-intdtau(1+Phi)(-eta_(mu nu)(dx^(mu))/(dtau)((d)x^(nu))/(dtau))^((1)/(2)):}\begin{equation*} S=-\int \mathrm{d} \tau(1+\Phi)\left(-\eta_{\mu \nu} \frac{\mathrm{d} x^{\mu}}{\mathrm{d} \tau} \frac{\mathrm{~d} x^{\nu}}{\mathrm{d} \tau}\right)^{\frac{1}{2}} \tag{42.94} \end{equation*}(42.94)S=dτ(1+Φ)(ημνdxμdτ dxνdτ)12
The latter equation implies that the effect of gravitation can be accounted for by a change in the metric η g η g eta rarr g\boldsymbol{\eta} \rightarrow \boldsymbol{g}ηg with
(42.95) g μ ν = ( 1 + Φ ) 2 η μ ν (42.95) g μ ν = ( 1 + Φ ) 2 η μ ν {:(42.95)g_(mu nu)=(1+Phi)^(2)eta_(mu nu):}\begin{equation*} g_{\mu \nu}=(1+\Phi)^{2} \eta_{\mu \nu} \tag{42.95} \end{equation*}(42.95)gμν=(1+Φ)2ημν Φ Φ Phi\PhiΦ from the previous question.
(a) Show that a Lagrangian density
(42.96) L = 1 8 π G ( μ Φ ) 2 ρ Φ (42.96) L = 1 8 π G μ Φ 2 ρ Φ {:(42.96)L=-(1)/(8pi G)(del_(mu)Phi)^(2)-rho Phi:}\begin{equation*} \mathcal{L}=-\frac{1}{8 \pi G}\left(\partial_{\mu} \Phi\right)^{2}-\rho \Phi \tag{42.96} \end{equation*}(42.96)L=18πG(μΦ)2ρΦ
where ρ ρ rho\rhoρ is the mass density of particles, leads to Poisson's equation for the gravitational field. (b) How must this Lagrangian be altered to take into account gravitational coupling to energy in addition to mass?

Charge conservation and the Bianchi identity

For I have wings equipped to fly

Up to the high vault of the sky.
Once these are harnessed, your swift mind
Views earth with loathing, far behind.
Boethius (c.480-c.524/6) The Consolation of Philosophy
The conservation of charge, when described using geometry, leads to a mathematical expression known as the Bianchi identity. In electromagnetism, a Bianchi identity generates two of the Maxwell equations: the equations of motion of the electromagnetic fields. In the case of gravitation, the analogous equation (also called a Bianchi identity) encodes conservation of energy, and provides the constraint that tells us how matter couples to the curvature of spacetime. In this chapter, we follow a geometrical path, originally beaten by Misner, Thorne, and Wheeler, from charge conservation to the electromagnetic and then gravitational Bianchi identity. 1 1 ^(1){ }^{1}1 The tools we need are mostly taken from Chapter 38 and the arguments in this chapter are mostly geometrical ones involving several computations. As a result, the chapter can be skipped on a first reading if you're happy to take the Bianchi identity on trust and without a physical or geometrical justification.

43.1 Conserving electric charge

Conservation of charge can be written as a divergence J = 0 J = 0 grad*J=0\boldsymbol{\nabla} \cdot \boldsymbol{J}=0J=0. In flat spacetime, 2 2 ^(2){ }^{2}2 we can write this as the component equation 3 3 ^(3){ }^{3}3
(43.2) J , μ μ = J μ x μ = 0 (flat space) (43.2) J , μ μ = J μ x μ = 0  (flat space)  {:(43.2)J_(,mu)^(mu)=(delJ^(mu))/(delx^(mu))=0quad" (flat space) ":}\begin{equation*} J_{, \mu}^{\mu}=\frac{\partial J^{\mu}}{\partial x^{\mu}}=0 \quad \text { (flat space) } \tag{43.2} \end{equation*}(43.2)J,μμ=Jμxμ=0 (flat space) 
This expresses the fact that charge is locally conserved, so no spontaneous generation of charges can take place in a given 4 -volume V V V\mathcal{V}V. This is important in relativity. Global conservation of charge would mean that no charges can be created or destroyed, but would allow a charge to disappear at one end of the Universe and reappear at some arbitrary point in spacetime. This would allow superluminal transport of charges, and so is forbidden. We must therefore have the stronger condition of local charge conservation enshrined in eqn 43.2 .
Let's unpack 4 4 ^(4){ }^{4}4 eqn 43.2 a little more. We are free to integrate the conservation equation over the 4 -volume V V V\mathcal{V}V and then apply Stokes' theorem,

43.1 Conserving electric charge 43.2 Electromagnetic gauge field 469
43.3 Gravitational curvature 471 Chapter summary 475 Exercises 475
1 1 ^(1){ }^{1}1 Luigi Bianchi (1856-1928) rediscovered the identity for the Riemann tensor in 1902. This had (according to Tullio Levi-Civita) been discovered by Gregorio Ricci-Curbastro around 1889, who had supposedly forgotten about it. A contracted version (examined in the exercises) had been derived in 1880 by Aurel Voss (1845-1931).
2 2 ^(2){ }^{2}2 We work in flat spacetime for simplicity, but remember that our valid tensor equations also apply in curved space. 3 3 ^(3){ }^{3}3 In a more familiar form, we write this as the flat-space continuity equation
(43.1) ρ t + J = 0 (43.1) ρ t + J = 0 {:(43.1)(del rho)/(del t)+ vec(grad)* vec(J)=0:}\begin{equation*} \frac{\partial \rho}{\partial t}+\vec{\nabla} \cdot \vec{J}=0 \tag{43.1} \end{equation*}(43.1)ρt+J=0
4 4 ^(4){ }^{4}4 As a warm up, consider the integral over a volume V V VVV in 3 -space with a boundary S S SSS, of
(43.3) V d V ( ρ t + J ) = 0 (43.3) V d V ρ t + J = 0 {:(43.3)int_(V)dV((del rho)/(del t)+( vec(grad))*( vec(J)))=0:}\begin{equation*} \int_{V} \mathrm{~d} V\left(\frac{\partial \rho}{\partial t}+\vec{\nabla} \cdot \vec{J}\right)=0 \tag{43.3} \end{equation*}(43.3)V dV(ρt+J)=0
The first term gives us the rate of change of the total charge Q Q QQQ in the volume and, applying the divergence theorem, we obtain
(43.4) Q t + S d S J = 0 (43.4) Q t + S d S J = 0 {:(43.4)(del Q)/(del t)+int_(S)d vec(S)* vec(J)=0:}\begin{equation*} \frac{\partial Q}{\partial t}+\int_{S} \mathrm{~d} \vec{S} \cdot \vec{J}=0 \tag{43.4} \end{equation*}(43.4)Qt+S dSJ=0
Integrating this latter equation with respect to time, we have the result that the total change in the amount of charge in the volume V V V\mathcal{V}V is equal to the total charge that flows through the surface S S SSS. This is local conservation of charge.
to yield
(43.5) 0 = V J μ x μ d V = V J μ d Σ μ (43.5) 0 = V J μ x μ d V = V J μ d Σ μ {:(43.5)0=int_(V)(delJ^(mu))/(delx^(mu))dV=int_(delV)J^(mu)dSigma_(mu):}\begin{equation*} 0=\int_{\mathcal{V}} \frac{\partial J^{\mu}}{\partial x^{\mu}} \mathrm{d} \mathcal{V}=\int_{\partial \mathcal{V}} J^{\mu} \mathrm{d} \Sigma_{\mu} \tag{43.5} \end{equation*}(43.5)0=VJμxμdV=VJμdΣμ
where V V delV\partial \mathcal{V}V is the boundary of the 4 -volume and d Σ μ d Σ μ dSigma_(mu)\mathrm{d} \Sigma_{\mu}dΣμ is a 3 -volume element. The right-hand side of eqn 43.5 is a statement that conservation means that the integral of currents across a boundary V V delV\partial \mathcal{V}V is zero. Charges enter and exit a given volume, but no new charge is generated in the volume V V V\mathcal{V}V. We shall now see how this fact is accounted for naturally in geometry.
The current of electric charge J J J\boldsymbol{J}J is the source of the electromagnetic field tensor F F F\boldsymbol{F}F. The charges create the fields, the fields tell the charges how to move. The link between the source J J J\boldsymbol{J}J and the field F F F\boldsymbol{F}F have, hard-wired into them, the fact that the source is conserved. This follows from the geometrical law we met in Chapter 38 that the boundary of a boundary is zero. To see this, we need to work with a slightly different version of the conservation law. Rather than J = 0 J = 0 grad*J=0\boldsymbol{\nabla} \cdot \boldsymbol{J}=0J=0, we shall use the equivalent version d J = 0 d J = 0 d***J=0\boldsymbol{d} \star \boldsymbol{J}=0dJ=0. We explain the origin and meaning of this expression below. We start by extracting the dual J J ***J\star \boldsymbol{J}J, which will be the flux in the new version of our conservation law.
Example 43.1
In flat (3+1)-dimensional spacetime, the vector J J JJJ has components J μ = J μ = J^(mu)=J^{\mu}=Jμ= ( ρ , J x , J y , J z ) ρ , J x , J y , J z (rho,J^(x),J^(y),J^(z))\left(\rho, J^{x}, J^{y}, J^{z}\right)(ρ,Jx,Jy,Jz) and so we have a dual 3 -form J J ***J\star \boldsymbol{J}J, with components
(43.6) ( J ) ν σ ρ = ε μ ν σ ρ J μ (43.6) ( J ) ν σ ρ = ε μ ν σ ρ J μ {:(43.6)(***J)_(nu sigma rho)=epsi_(mu nu sigma rho)J^(mu):}\begin{equation*} (\star J)_{\nu \sigma \rho}=\varepsilon_{\mu \nu \sigma \rho} J^{\mu} \tag{43.6} \end{equation*}(43.6)(J)νσρ=εμνσρJμ
For example, as we saw in the last chapter,
(43.7) ( J ) 012 d t d x d y = ε 3012 J z d t d x d y (43.7) ( J ) 012 d t d x d y = ε 3012 J z d t d x d y {:(43.7)(***J)_(012)dt^^dx^^dy=epsi_(3012)J^(z)dt^^dx^^dy:}\begin{equation*} (\star \boldsymbol{J})_{012} \boldsymbol{d} t \wedge \boldsymbol{d} x \wedge \boldsymbol{d} y=\varepsilon_{3012} J^{z} \boldsymbol{d} t \wedge \boldsymbol{d} x \wedge \boldsymbol{d} y \tag{43.7} \end{equation*}(43.7)(J)012dtdxdy=ε3012Jzdtdxdy
5 5 ^(5){ }^{5}5 We saw in Chapter 38 that the 3volume element can be written as the 3-form
d Σ ~ μ = ε μ α β γ d x α d x β d x γ , = 1 3 ! ε μ α β γ d x α d x β d x γ . d Σ ~ μ = ε μ α β γ d x α d x β d x γ , = 1 3 ! ε μ α β γ d x α d x β d x γ . {:[d tilde(Sigma)_(mu)=epsi_(mu∣alpha beta gamma)∣dx^(alpha)^^dx^(beta)^^dx^(gamma)","],[=(1)/(3!)*epsi_(mu alpha beta gamma)dx^(alpha)^^dx^(beta)^^dx^(gamma).]:}\begin{aligned} \mathrm{d} \tilde{\boldsymbol{\Sigma}}_{\mu} & =\varepsilon_{\mu \mid \alpha \beta \gamma} \mid \boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta} \wedge \boldsymbol{d} x^{\gamma}, \\ & =\frac{1}{3!} \cdot \varepsilon_{\mu \alpha \beta \gamma} \boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta} \wedge \boldsymbol{d} x^{\gamma} .\end{aligned}dΣ~μ=εμαβγdxαdxβdxγ,=13!εμαβγdxαdxβdxγ.
Explicitly, we have
J = ( J ) 123 d x 1 d x 2 d x 3 + ( J ) 023 d x 0 d x 2 d x 3 + ( J ) 031 d x 0 d x 3 d x 1 + ( J ) 012 d x 0 d x 1 d x 2 = ρ d x 1 d x 2 d x 3 J 1 d x 0 d x 2 d x 3 J = ( J ) 123 d x 1 d x 2 d x 3 + ( J ) 023 d x 0 d x 2 d x 3 + ( J ) 031 d x 0 d x 3 d x 1 + ( J ) 012 d x 0 d x 1 d x 2 = ρ d x 1 d x 2 d x 3 J 1 d x 0 d x 2 d x 3 {:[***J=(***J)_(123)dx^(1)^^dx^(2)^^dx^(3)+(***J)_(023)dx^(0)^^dx^(2)^^dx^(3)],[+(***J)_(031)dx^(0)^^dx^(3)^^dx^(1)+(***J)_(012)dx^(0)^^dx^(1)^^dx^(2)],[=rho dx^(1)^^dx^(2)^^dx^(3)-J^(1)dx^(0)^^dx^(2)^^dx^(3)]:}\begin{aligned} \star \boldsymbol{J}= & (\star \boldsymbol{J})_{123} \boldsymbol{d} x^{1} \wedge \boldsymbol{d} x^{2} \wedge \boldsymbol{d} x^{3}+(\star \boldsymbol{J})_{023} \boldsymbol{d} x^{0} \wedge \boldsymbol{d} x^{2} \wedge \boldsymbol{d} x^{3} \\ & +(\star \boldsymbol{J})_{031} \boldsymbol{d} x^{0} \wedge \boldsymbol{d} x^{3} \wedge \boldsymbol{d} x^{1}+(\star \boldsymbol{J})_{012} \boldsymbol{d} x^{0} \wedge \boldsymbol{d} x^{1} \wedge \boldsymbol{d} x^{2} \\ = & \rho \boldsymbol{d} x^{1} \wedge \boldsymbol{d} x^{2} \wedge \boldsymbol{d} x^{3}-J^{1} \boldsymbol{d} x^{0} \wedge \boldsymbol{d} x^{2} \wedge \boldsymbol{d} x^{3} \end{aligned}J=(J)123dx1dx2dx3+(J)023dx0dx2dx3+(J)031dx0dx3dx1+(J)012dx0dx1dx2=ρdx1dx2dx3J1dx0dx2dx3
(43.8) + J 2 d x 0 d x 1 d x 3 J 3 d x 0 d x 1 d x 2 (43.8) + J 2 d x 0 d x 1 d x 3 J 3 d x 0 d x 1 d x 2 {:(43.8)+J^(2)dx^(0)^^dx^(1)^^dx^(3)-J^(3)dx^(0)^^dx^(1)^^dx^(2):}\begin{equation*} +J^{2} \boldsymbol{d} x^{0} \wedge \boldsymbol{d} x^{1} \wedge \boldsymbol{d} x^{3}-J^{3} \boldsymbol{d} x^{0} \wedge \boldsymbol{d} x^{1} \wedge \boldsymbol{d} x^{2} \tag{43.8} \end{equation*}(43.8)+J2dx0dx1dx3J3dx0dx1dx2
It is simple to verify that the dual of the source field J J JJJ can also be written as 5 5 ^(5){ }^{5}5
(43.9) J = J μ d Σ ~ μ = J μ ε μ | α β γ | d x α d x β d x γ (43.9) J = J μ d Σ ~ μ = J μ ε μ | α β γ | d x α d x β d x γ {:(43.9)***J=J^(mu)d tilde(Sigma)_(mu)=J^(mu)epsi_(mu|alpha beta gamma|)dx^(alpha)^^dx^(beta)^^dx^(gamma):}\begin{equation*} \star \boldsymbol{J}=J^{\mu} \mathrm{d} \tilde{\boldsymbol{\Sigma}}_{\mu}=J^{\mu} \varepsilon_{\mu|\alpha \beta \gamma|} \boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta} \wedge \boldsymbol{d} x^{\gamma} \tag{43.9} \end{equation*}(43.9)J=JμdΣ~μ=Jμεμ|αβγ|dxαdxβdxγ
which is the integrand in eqn 43.5 . The exterior derivative of this quantity may be calculated straightforwardly, by following the usual rules
d J = ( J ) 123 x 0 d x 0 d x 1 d x 2 d x 3 + ( J ) 023 x 1 d x 1 d x 0 d x 2 d x 3 (43.10) + ( J ) 031 x 2 d x 2 d x 0 d x 3 d x 1 + J ) 012 x 3 d x 3 d x 0 d x 1 d x 2 d J = ( J ) 123 x 0 d x 0 d x 1 d x 2 d x 3 + ( J ) 023 x 1 d x 1 d x 0 d x 2 d x 3 (43.10) + ( J ) 031 x 2 d x 2 d x 0 d x 3 d x 1 + J ) 012 x 3 d x 3 d x 0 d x 1 d x 2 {:[d***J=(del(***J)_(123))/(delx^(0))dx^(0)^^dx^(1)^^dx^(2)^^dx^(3)+(del(***J)_(023))/(delx^(1))dx^(1)^^dx^(0)^^dx^(2)^^dx^(3)],[(43.10)+(del(***J)_(031))/(delx^(2))dx^(2)^^dx^(0)^^dx^(3)^^dx^(1)+(del***J)_(012))/(delx^(3))dx^(3)^^dx^(0)^^dx^(1)^^dx^(2)]:}\begin{align*} \boldsymbol{d} \star \boldsymbol{J}= & \frac{\partial(\star \boldsymbol{J})_{123}}{\partial x^{0}} \boldsymbol{d} x^{0} \wedge \boldsymbol{d} x^{1} \wedge \boldsymbol{d} x^{2} \wedge \boldsymbol{d} x^{3}+\frac{\partial(\star \boldsymbol{J})_{023}}{\partial x^{1}} \boldsymbol{d} x^{1} \wedge \boldsymbol{d} x^{0} \wedge \boldsymbol{d} x^{2} \wedge \boldsymbol{d} x^{3} \\ & +\frac{\partial(\star \boldsymbol{J})_{031}}{\partial x^{2}} \boldsymbol{d} x^{2} \wedge \boldsymbol{d} x^{0} \wedge \boldsymbol{d} x^{3} \wedge \boldsymbol{d} x^{1}+\frac{\partial \star \boldsymbol{J})_{012}}{\partial x^{3}} \boldsymbol{d} x^{3} \wedge \boldsymbol{d} x^{0} \wedge \boldsymbol{d} x^{1} \wedge \boldsymbol{d} x^{2} \tag{43.10} \end{align*}dJ=(J)123x0dx0dx1dx2dx3+(J)023x1dx1dx0dx2dx3(43.10)+(J)031x2dx2dx0dx3dx1+J)012x3dx3dx0dx1dx2
In the same way that J J ***J\star JJ represents J μ d Σ μ J μ d Σ μ J^(mu)dSigma_(mu)J^{\mu} \mathrm{d} \Sigma_{\mu}JμdΣμ in eqn 43.5 , this latter equation for d J d J d***J\boldsymbol{d} \star \boldsymbol{J}dJ expresses the integrand J μ , μ d V J μ , μ d V J^(mu)_(,mu)dVJ^{\mu}{ }_{, \mu} \mathrm{d} \mathcal{V}Jμ,μdV. This is because the 4 -volume element can be replaced with the 4 -form ε 0123 d t d x d y d z ε 0123 d t d x d y d z epsi_(0123)dt^^dx^^dy^^dz\varepsilon_{0123} \boldsymbol{d} t \wedge \boldsymbol{d} x \wedge \boldsymbol{d} y \wedge \boldsymbol{d} zε0123dtdxdydz and so we can write
d J = ( J ) ε | α β γ δ | d x α d x β d x γ d x δ (43.11) = 1 4 ! ( J ) ε α β γ δ d x α d x β d x γ d x δ d J = ( J ) ε | α β γ δ | d x α d x β d x γ d x δ (43.11) = 1 4 ! ( J ) ε α β γ δ d x α d x β d x γ d x δ {:[d***J=(grad*J)epsi_(|alpha beta gamma delta|)dx^(alpha)^^dx^(beta)^^dx^(gamma)^^dx^(delta)],[(43.11)=(1)/(4!)(grad*J)epsi_(alpha beta gamma delta)dx^(alpha)^^dx^(beta)^^dx^(gamma)^^dx^(delta)]:}\begin{align*} \boldsymbol{d} \star \boldsymbol{J} & =(\boldsymbol{\nabla} \cdot \boldsymbol{J}) \varepsilon_{|\alpha \beta \gamma \delta|} \boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta} \wedge \boldsymbol{d} x^{\gamma} \wedge \boldsymbol{d} x^{\delta} \\ & =\frac{1}{4!}(\boldsymbol{\nabla} \cdot \boldsymbol{J}) \varepsilon_{\alpha \beta \gamma \delta} \boldsymbol{d} x^{\alpha} \wedge \boldsymbol{d} x^{\beta} \wedge \boldsymbol{d} x^{\gamma} \wedge \boldsymbol{d} x^{\delta} \tag{43.11} \end{align*}dJ=(J)ε|αβγδ|dxαdxβdxγdxδ(43.11)=14!(J)εαβγδdxαdxβdxγdxδ
We conclude that conservation of charge in eqn 43.5 may be written as
(43.12) 0 = V d J = V J (43.12) 0 = V d J = V J {:(43.12)0=int_(V)d***J=int_(delV)***J:}\begin{equation*} 0=\int_{\mathcal{V}} d \star J=\int_{\partial \mathcal{V}} \star J \tag{43.12} \end{equation*}(43.12)0=VdJ=VJ
The conservation law is shown schematically in Fig. 43.1.
The point of this section is that the vanishing of these two integrals follows inevitably from (i) Maxwell's equations and (ii) the geometric fact that the boundary of a boundary is zero. Specifically, we can relate the dual of the source J J ***J\star \boldsymbol{J}J to the dual of the electromagnetic field F F F\boldsymbol{F}F using the Maxwell equation J = d F J = d F ***J=d***F\star \boldsymbol{J}=\boldsymbol{d} \star \boldsymbol{F}J=dF. All we need do now is write the conservation law in terms of an integral over d F d F d***F\boldsymbol{d} \star \boldsymbol{F}dF rather than J J ***J\star \boldsymbol{J}J and then apply Stokes theorem
(43.13) 0 = V J = V d F = V F . (43.13) 0 = V J = V d F = V F . {:(43.13)0=int_(delV)***J=int_(delV)d***F=int_(del delV)***F.:}\begin{equation*} 0=\int_{\partial \mathcal{V}} \star J=\int_{\partial \mathcal{V}} d \star \boldsymbol{F}=\int_{\partial \partial \mathcal{V}} \star \boldsymbol{F} . \tag{43.13} \end{equation*}(43.13)0=VJ=VdF=VF.
In this final equation we see that, since the boundary of the boundary V V del delV\partial \partial \mathcal{V}V is zero, this integral must itself yield zero (since it's being computed over chain that vanishes). Working backwards then, we can view the conservation of the source field, encoded in d J = 0 d J = 0 d***J=0\boldsymbol{d} \star \boldsymbol{J}=0dJ=0 as following from the fact that V = 0 . 6 V = 0 . 6 del delV=0.^(6)\partial \partial \mathcal{V}=0 .{ }^{6}V=0.6

43.2 Electromagnetic gauge field

The argument sketched in the last section can be applied to the electromagnetic gauge field A ~ A ~ tilde(A)\tilde{\boldsymbol{A}}A~ to extract a Bianchi identity for electromagnetism. We write the electromagnetic 2 -form field in terms of the gauge 1 -form field as F = d A ~ F = d A ~ F=d tilde(A)\boldsymbol{F}=\boldsymbol{d} \tilde{\boldsymbol{A}}F=dA~. Consider how the expression d F d F dF\boldsymbol{d} \boldsymbol{F}dF is modified through the application of Stokes' theorem
(43.14) V d F ~ = V F ~ . (43.14) V d F ~ = V F ~ . {:(43.14)int_(V)d tilde(F)=int_(delV) tilde(F).:}\begin{equation*} \int_{\mathcal{V}} d \tilde{\boldsymbol{F}}=\int_{\partial \mathcal{V}} \tilde{\boldsymbol{F}} . \tag{43.14} \end{equation*}(43.14)VdF~=VF~.
Now, inserting the gauge field, we see
(43.15) V F ~ = V d A ~ = V A ~ = 0 (43.15) V F ~ = V d A ~ = V A ~ = 0 {:(43.15)int_(delV) tilde(F)=int_(delV)d tilde(A)=int_(del delV) tilde(A)=0:}\begin{equation*} \int_{\partial \mathcal{V}} \tilde{\boldsymbol{F}}=\int_{\partial \mathcal{V}} \boldsymbol{d} \tilde{\boldsymbol{A}}=\int_{\partial \partial \mathcal{V}} \tilde{\boldsymbol{A}}=0 \tag{43.15} \end{equation*}(43.15)VF~=VdA~=VA~=0
In words, the facts that (i) the boundary of a boundary is zero, and (ii) that F ~ F ~ tilde(F)\tilde{\boldsymbol{F}}F~ is exact, mean that it is inevitable 7 7 ^(7){ }^{7}7 that d F ~ = 0 d F ~ = 0 d tilde(F)=0\boldsymbol{d} \tilde{\boldsymbol{F}}=0dF~=0. This is the first statement of the Bianchi identity for electromagnetism.
There are two ways to see the consequences of this, examined in the next examples.

Example 43.2

We examine the term V A ~ V A ~ int_(del delV) tilde(A)\int_{\partial \partial \mathcal{V}} \tilde{\boldsymbol{A}}VA~, integrating the 1-form A ~ = A μ d x μ A ~ = A μ d x μ tilde(A)=A_(mu)dx^(mu)\tilde{\boldsymbol{A}}=A_{\mu} \boldsymbol{d} x^{\mu}A~=Aμdxμ over the boundary of a cube in flat spacetime. That is to say, we examine
(43.16) ν A μ d x μ , (43.16) ν A μ d x μ , {:(43.16)int_(del del nu)A_(mu)dx^(mu)",":}\begin{equation*} \int_{\partial \partial \nu} A_{\mu} \boldsymbol{d} x^{\mu}, \tag{43.16} \end{equation*}(43.16)νAμdxμ,
Fig. 43.1 Conservation of charge tells us that the rate of change of the amount of charge in volume V V V\mathcal{V}V is equal to the flux J J ***J\star JJ summed over the surface V V delV\partial \mathcal{V}V.
6 6 ^(6){ }^{6}6 This explanation can be found in Cartan, although this was something Richard Feynman was not aware of, as shown by his remark in his Lectures of Gravitation that "I do not, offhand, know the geometric significance of the Bianchi identity." The version of the Bianchi identity." The version
we describe here is taken from Misner, we describe here is tak
Thorne, and Wheeler.
7 7 ^(7){ }^{7}7 Of course this was also mandated by d d = 0 ( d d = 0 ( dd=0(d \boldsymbol{d}=0(dd=0( via d d A ~ = d F ~ = 0 ) d d A ~ = d F ~ = 0 ) dd tilde(A)=d tilde(F)=0)d \boldsymbol{d} \tilde{\boldsymbol{A}}=\boldsymbol{d} \tilde{\boldsymbol{F}}=0)ddA~=dF~=0), and we can see here the sense in which this is dual to = 0 = 0 del del=0\partial \partial=0=0.
Fig. 43.2 A contribution to the integral over the boundary V V del delV\partial \partial \mathcal{V}V.
8 8 ^(8){ }^{8}8 This flat-spacetime equation can be written using comma notation as F [ α β , γ ] = 0 F [ α β , γ ] = 0 F_([alpha beta,gamma])=0F_{[\alpha \beta, \gamma]}=0F[αβ,γ]=0. The comma-goes-tosemicolon rule for curved spacetime then says F [ α β ; γ ] = 0 F [ α β ; γ ] = 0 F_([alpha beta;gamma])=0F_{[\alpha \beta ; \gamma]}=0F[αβ;γ]=0.
9 9 ^(9){ }^{9}9 The rule, from Chapter 32 is
F ~ , e x e y = F | i 1 i 2 | ( e x e y ) i 1 i 2 F ~ , e x e y = F i 1 i 2 e x e y i 1 i 2 (:( tilde(F)),e_(x)^^e_(y):)=F_(|i_(1)i_(2)|)(e_(x)^^e_(y))^(i_(1)i_(2))\left\langle\tilde{\boldsymbol{F}}, \boldsymbol{e}_{x} \wedge \boldsymbol{e}_{y}\right\rangle=F_{\left|i_{1} i_{2}\right|}\left(\boldsymbol{e}_{x} \wedge \boldsymbol{e}_{y}\right)^{i_{1} i_{2}}F~,exey=F|i1i2|(exey)i1i2
which, since the only non-vanishing component of the bivector is ( e x e x e_(x)^^\boldsymbol{e}_{x} \wedgeex e y ) x y = 1 e y x y = 1 e_(y))^(xy)=1\left.\boldsymbol{e}_{y}\right)^{x y}=1ey)xy=1, yields F x y F x y F_(xy)F_{x y}Fxy. In fact, it's trivial to see this by simply saying F ~ , e x F ~ , e x (:( tilde(F)),e_(x)^^:}\left\langle\tilde{\boldsymbol{F}}, \boldsymbol{e}_{x} \wedge\right.F~,ex e y = F ~ ( e x , e y ) e y = F ~ e x , e y {:e_(y):)= tilde(F)(e_(x),e_(y))\left.\boldsymbol{e}_{y}\right\rangle=\tilde{\boldsymbol{F}}\left(\boldsymbol{e}_{x}, \boldsymbol{e}_{y}\right)ey=F~(ex,ey).
as we traverse each edge of the cube. Consider the path shown in Fig. 43.2. Labelling the corners of the square anticlockwise from x , y x , y x,yx, yx,y we find that in traversing a square we obtain a contribution
A x ( A ) d x + A y ( B ) d y A x ( C ) d x A y ( D ) d y A x ( A ) d x + A y ( B ) d y A x ( C ) d x A y ( D ) d y A_(x)(A)dx+A_(y)(B)dy-A_(x)(C)dx-A_(y)(D)dyA_{x}(\mathcal{A}) \mathrm{d} x+A_{y}(\mathcal{B}) \mathrm{d} y-A_{x}(\mathcal{C}) \mathrm{d} x-A_{y}(\mathcal{D}) \mathrm{d} yAx(A)dx+Ay(B)dyAx(C)dxAy(D)dy
= [ A x ( A ) A x ( C ) ] d x + [ A y ( B ) A y ( D ) ] d y = A x ( A ) A x ( C ) d x + A y ( B ) A y ( D ) d y =[A_(x)(A)-A_(x)(C)]dx+[A_(y)(B)-A_(y)(D)]dy=\left[A_{x}(\mathcal{A})-A_{x}(\mathcal{C})\right] \mathrm{d} x+\left[A_{y}(\mathcal{B})-A_{y}(\mathcal{D})\right] \mathrm{d} y=[Ax(A)Ax(C)]dx+[Ay(B)Ay(D)]dy
= [ A x ( A ) A x ( A ) A x y d y ] d x + [ A y ( B ) A y ( B ) + A y x d x ] d y = A x ( A ) A x ( A ) A x y d y d x + A y ( B ) A y ( B ) + A y x d x d y =[A_(x)(A)-A_(x)(A)-(delA_(x))/(del y)(d)y]dx+[A_(y)(B)-A_(y)(B)+(delA_(y))/(del x)(d)x]dy=\left[A_{x}(\mathcal{A})-A_{x}(\mathcal{A})-\frac{\partial A_{x}}{\partial y} \mathrm{~d} y\right] \mathrm{d} x+\left[A_{y}(\mathcal{B})-A_{y}(\mathcal{B})+\frac{\partial A_{y}}{\partial x} \mathrm{~d} x\right] \mathrm{d} y=[Ax(A)Ax(A)Axy dy]dx+[Ay(B)Ay(B)+Ayx dx]dy
(43.17) = ( A y x A x y ) d x d y (43.17) = A y x A x y d x d y {:(43.17)=((delA_(y))/(del x)-(delA_(x))/(del y))dxdy:}\begin{equation*} =\left(\frac{\partial A_{y}}{\partial x}-\frac{\partial A_{x}}{\partial y}\right) \mathrm{d} x \mathrm{~d} y \tag{43.17} \end{equation*}(43.17)=(AyxAxy)dx dy
The contribution from the face shown is therefore given by
(43.18) ( A y x A x y ) d x d y = F x y d x d y (43.18) A y x A x y d x d y = F x y d x d y {:(43.18)int((delA_(y))/(del x)-(delA_(x))/(del y))dxdy=intF_(xy)dxdy:}\begin{equation*} \int\left(\frac{\partial A_{y}}{\partial x}-\frac{\partial A_{x}}{\partial y}\right) \mathrm{d} x \mathrm{~d} y=\int F_{x y} \mathrm{~d} x \mathrm{~d} y \tag{43.18} \end{equation*}(43.18)(AyxAxy)dx dy=Fxy dx dy
Repeating this procedure and adding to this the contribution from the opposite face, which is traversed in the other direction, we obtain
(43.19) [ F x y ( z + d z ) F x y ( z ) ] d x d y = F x y z d x d y d z (43.19) F x y ( z + d z ) F x y ( z ) d x d y = F x y z d x d y d z {:(43.19)int[F_(xy)(z+dz)-F_(xy)(z)]dxdy=int(delF_(xy))/(del z)dxdydz:}\begin{equation*} \int\left[F_{x y}(z+\mathrm{d} z)-F_{x y}(z)\right] \mathrm{d} x \mathrm{~d} y=\int \frac{\partial F_{x y}}{\partial z} \mathrm{~d} x \mathrm{~d} y \mathrm{~d} z \tag{43.19} \end{equation*}(43.19)[Fxy(z+dz)Fxy(z)]dx dy=Fxyz dx dy dz
Adding all of the contributions, we obtain
(43.20) V A μ d x μ = ( F x y z + F z x y + F y z x ) d x d y d z (43.20) V A μ d x μ = F x y z + F z x y + F y z x d x d y d z {:(43.20)int_(del delV)A_(mu)dx^(mu)=int((delF_(xy))/(del z)+(delF_(zx))/(del y)+(delF_(yz))/(del x))dxdydz:}\begin{equation*} \int_{\partial \partial \mathcal{V}} A_{\mu} \boldsymbol{d} x^{\mu}=\int\left(\frac{\partial F_{x y}}{\partial z}+\frac{\partial F_{z x}}{\partial y}+\frac{\partial F_{y z}}{\partial x}\right) \mathrm{d} x \mathrm{~d} y \mathrm{~d} z \tag{43.20} \end{equation*}(43.20)VAμdxμ=(Fxyz+Fzxy+Fyzx)dx dy dz
The integrand on the right must be zero, since we known that V A ~ = 0 V A ~ = 0 int_(del delV) tilde(A)=0\int_{\partial \partial \mathcal{V}} \tilde{\boldsymbol{A}}=0VA~=0. The cube can then be reoriented in the ( 3 + 1 ) ( 3 + 1 ) (3+1)(3+1)(3+1) dimensions in which we're working, and the argument repeated for different coordinates.
We conclude that in flat spacetime the components of the Faraday tensor obey the constraining equation 8 8 ^(8){ }^{8}8
(43.21) F α β , γ + F γ α , β + F β γ , α = 0 (43.21) F α β , γ + F γ α , β + F β γ , α = 0 {:(43.21)F_(alpha beta,gamma)+F_(gamma alpha,beta)+F_(beta gamma,alpha)=0:}\begin{equation*} F_{\alpha \beta, \gamma}+F_{\gamma \alpha, \beta}+F_{\beta \gamma, \alpha}=0 \tag{43.21} \end{equation*}(43.21)Fαβ,γ+Fγα,β+Fβγ,α=0
This equation, a consequence of the fact that a boundary of a boundary is zero, is the Bianchi identity in component form. They gives us the two Maxwell's equations which, as we saw in the last chapter, are contained in d F ~ = 0 d F ~ = 0 d tilde(F)=0\boldsymbol{d} \tilde{\boldsymbol{F}}=0dF~=0.

Example 43.3

Another way to approach this same problem, which proves useful when we consider gravity, is to consider I = ν F ~ = 0 I = ν F ~ = 0 I=int_(del nu) tilde(F)=0I=\int_{\partial \nu} \tilde{\boldsymbol{F}}=0I=νF~=0, that is, the surface integral of the 2 -form F ~ F ~ tilde(F)\tilde{\boldsymbol{F}}F~. We can interpret this integral over a 2 -form in the spirit of Chapter 38 . We represent the surface by the bivector u v u v u^^v\boldsymbol{u} \wedge \boldsymbol{v}uv and, to evaluate the integral, take the inner product of this bivector with the 2 -form F F ¯ bar(F)\overline{\boldsymbol{F}}F. Geometrically, this returns a number that tells us how many tubes of F F F\boldsymbol{F}F are cut by the parallelogram u v u v u^^v\boldsymbol{u} \wedge \boldsymbol{v}uv. We have a contribution I x y I x y I_(xy)I_{x y}Ixy from the face shown in Fig. 43.2 given by
(43.22) I x y = Δ x e x Δ y e y F ~ = F ~ , e x e y Δ x Δ y (43.22) I x y = Δ x e x Δ y e y F ~ = F ~ , e x e y Δ x Δ y {:(43.22)I_(xy)=int_(Delta xe_(x)^^Delta ye_(y)) tilde(F)=int(:( tilde(F)),e_(x)^^e_(y):)Delta x Delta y:}\begin{equation*} I_{x y}=\int_{\Delta x \boldsymbol{e}_{x} \wedge \Delta y \boldsymbol{e}_{y}} \tilde{\boldsymbol{F}}=\int\left\langle\tilde{\boldsymbol{F}}, \boldsymbol{e}_{x} \wedge \boldsymbol{e}_{y}\right\rangle \Delta x \Delta y \tag{43.22} \end{equation*}(43.22)Ixy=ΔxexΔyeyF~=F~,exeyΔxΔy
Using the expression for the inner product of a bivector and a 2 -form 9 9 ^(9){ }^{9}9 we have
(43.23) I x y = F x y Δ x Δ y (43.23) I x y = F x y Δ x Δ y {:(43.23)I_(xy)=intF_(xy)Delta x Delta y:}\begin{equation*} I_{x y}=\int F_{x y} \Delta x \Delta y \tag{43.23} \end{equation*}(43.23)Ixy=FxyΔxΔy
Repeating the argument for the opposite face, as in the last example, and adding this (exercise) leads to the contribution
(43.24) F x y z Δ x Δ y Δ z (43.24) F x y z Δ x Δ y Δ z {:(43.24)int(delF_(xy))/(del z)Delta x Delta y Delta z:}\begin{equation*} \int \frac{\partial F_{x y}}{\partial z} \Delta x \Delta y \Delta z \tag{43.24} \end{equation*}(43.24)FxyzΔxΔyΔz
Repeating and adding the contributions from the other faces gives us the Bianchi identity in component form once again.

43.3 Gravitational curvature

We saw in Chapter 13 that the Bianchi identity for gravitation was an essential part of justifying the Einstein equation. Just like its electromagnetic analogue, the gravitational Bianchi identity follows from considering the consequence of the boundary of a boundary being zero. In the electromagnetic case, we evaluated the effect of this on the 2 -form field F F F\boldsymbol{F}F. For gravitation the analogous quantity is the (1,3) Riemann tensor R R R\boldsymbol{R}R. Instead of working with R R R\boldsymbol{R}R directly, it is easiest to use Cartan's method (Chapter 36) to find the curvature 2-form, which allows us to act on vectors using the operator equation
(43.25) d 2 v = R ( v ) , (43.25) d 2 v = R ( v ) , {:(43.25)d^(2)v=R(v)",":}\begin{equation*} \boldsymbol{d}^{2} \boldsymbol{v}=\mathcal{R}(\boldsymbol{v}), \tag{43.25} \end{equation*}(43.25)d2v=R(v),
where the curvature operator is R ( ) = ω ν ( ) e μ R μ ν R ( ) = ω ν ( ) e μ R μ ν R()=omega^(nu)()oxe_(mu)oxR^(mu)_(nu)\mathcal{R}()=\boldsymbol{\omega}^{\nu}() \otimes \boldsymbol{e}_{\mu} \otimes \mathcal{R}^{\mu}{ }_{\nu}R()=ων()eμRμν and R μ ν R μ ν R^(mu)_(nu)\mathcal{R}^{\mu}{ }_{\nu}Rμν is the all-important curvature 2 -form given in terms of the connection 1 -forms 10 10 ^(10){ }^{10}10 via
(43.27) R ν μ = d ω μ ν + ω μ α ω α ν . (43.27) R ν μ = d ω μ ν + ω μ α ω α ν . {:(43.27)R_(nu)^(mu)=domega^(mu)_(nu)+omega^(mu)_(alpha)^^omega^(alpha)_(nu).:}\begin{equation*} \mathcal{R}_{\nu}^{\mu}=\boldsymbol{d} \omega^{\mu}{ }_{\nu}+\boldsymbol{\omega}^{\mu}{ }_{\alpha} \wedge \boldsymbol{\omega}^{\alpha}{ }_{\nu} . \tag{43.27} \end{equation*}(43.27)Rνμ=dωμν+ωμαωαν.
Why focus on R R R\mathcal{R}R ? The answer is that, just as the electromagnetic Bianchi identity can be written as d F = 0 d F = 0 dF=0\boldsymbol{d} \boldsymbol{F}=0dF=0, the gravitational version is
The Bianchi identity for gravitation:
(43.28) d R = 0 . (43.28) d R = 0 . {:(43.28)dR=0.:}\begin{equation*} d \mathcal{R}=0 . \tag{43.28} \end{equation*}(43.28)dR=0.
We prove this important result in three steps, each involving a computation.
Step I: The result of taking an exterior derivative of the curvature 2form is
(43.29) d R μ ν = R μ α ω α ν ω μ α R α ν . (43.29) d R μ ν = R μ α ω α ν ω μ α R α ν . {:(43.29)dR^(mu)_(nu)=R^(mu)_(alpha)^^omega^(alpha)_(nu)-omega^(mu)_(alpha)^^R^(alpha)_(nu).:}\begin{equation*} \boldsymbol{d} \mathcal{R}^{\mu}{ }_{\nu}=\mathcal{R}^{\mu}{ }_{\alpha} \wedge \boldsymbol{\omega}^{\alpha}{ }_{\nu}-\boldsymbol{\omega}^{\mu}{ }_{\alpha} \wedge \mathcal{R}^{\alpha}{ }_{\nu} . \tag{43.29} \end{equation*}(43.29)dRμν=RμαωανωμαRαν.
Example 43.4
Equation 43.29 can be proved as follows. We work in the orthonormal frame, 11 11 ^(11){ }^{11}11 so that we have ω μ ν = ω ν μ ω μ ν = ω ν μ omega_(mu nu)=-omega_(nu mu)\boldsymbol{\omega}_{\mu \nu}=-\boldsymbol{\omega}_{\nu \mu}ωμν=ωνμ. Take the derivative of R ν μ R ν μ R_(nu)^(mu)\mathcal{R}_{\nu}^{\mu}Rνμ to find
d R μ ν = d ( ω μ α ω α ν ) = d ω α μ ω α ν ω μ α d ω α ν (43.30) = ( R α μ ω μ β ω β α ) ω α ν ω μ α ( R α ν ω α β ω β ν ) . d R μ ν = d ω μ α ω α ν = d ω α μ ω α ν ω μ α d ω α ν (43.30) = R α μ ω μ β ω β α ω α ν ω μ α R α ν ω α β ω β ν . {:[dR^(mu)_(nu)=d(omega^(mu)_(alpha)^^omega^(alpha)_(nu))],[=domega_(alpha)^(mu)^^omega^(alpha)_(nu)-omega^(mu)_(alpha)^^domega^(alpha)_(nu)],[(43.30)=(R_(alpha)^(mu)-omega^(mu)_(beta)^^omega^(beta)_(alpha))^^omega^(alpha)_(nu)-omega^(mu)_(alpha)^^(R^(alpha)_(nu)-omega^(alpha)_(beta)^^omega^(beta)_(nu)).]:}\begin{align*} \boldsymbol{d}{\mathcal{R}^{\mu}}_{\nu} & =\boldsymbol{d}\left(\boldsymbol{\omega}^{\mu}{ }_{\alpha} \wedge \boldsymbol{\omega}^{\alpha}{ }_{\nu}\right) \\ & =\boldsymbol{d} \boldsymbol{\omega}_{\alpha}^{\mu} \wedge \boldsymbol{\omega}^{\alpha}{ }_{\nu}-\boldsymbol{\omega}^{\mu}{ }_{\alpha} \wedge \boldsymbol{d} \boldsymbol{\omega}^{\alpha}{ }_{\nu} \\ & =\left(\mathcal{R}_{\alpha}^{\mu}-\boldsymbol{\omega}^{\mu}{ }_{\beta} \wedge \boldsymbol{\omega}^{\beta}{ }_{\alpha}\right) \wedge \boldsymbol{\omega}^{\alpha}{ }_{\nu}-\boldsymbol{\omega}^{\mu}{ }_{\alpha} \wedge\left(\mathcal{R}^{\alpha}{ }_{\nu}-\boldsymbol{\omega}^{\alpha}{ }_{\beta} \wedge \boldsymbol{\omega}^{\beta}{ }_{\nu}\right) . \tag{43.30} \end{align*}dRμν=d(ωμαωαν)=dωαμωανωμαdωαν(43.30)=(Rαμωμβωβα)ωανωμα(Rανωαβωβν).
We find that the triple wedge products cancel and we have that
(43.31) d R ν μ R α μ α ω α ν + ω α μ α R ν α = 0 (43.31) d R ν μ R α μ α ω α ν + ω α μ α R ν α = 0 {:(43.31)dR_(nu)^(mu)-R_(alpha)^(mu)_(alpha)^^omega^(alpha)_(nu)+omega_(alpha)^(mu)_(alpha)^^R_(nu)^(alpha)=0:}\begin{equation*} \boldsymbol{d} \mathcal{R}_{\nu}^{\mu}-\mathcal{R}_{\alpha}^{\mu}{ }_{\alpha} \wedge \boldsymbol{\omega}^{\alpha}{ }_{\nu}+\boldsymbol{\omega}_{\alpha}^{\mu}{ }_{\alpha} \wedge \mathcal{R}_{\nu}^{\alpha}=0 \tag{43.31} \end{equation*}(43.31)dRνμRαμαωαν+ωαμαRνα=0
Step II: We need the result of taking the exterior derivative of a bivector formed from the basis vectors. 12 12 ^(12){ }^{12}12
10 10 ^(10){ }^{10}10 We also have the equation that determines the connection 1-forms
d ω μ + ω ν μ ω ν = 0 d ω μ + ω ν μ ω ν = 0 domega^(mu)+omega_(nu)^(mu)^^omega^(nu)=0\boldsymbol{d} \boldsymbol{\omega}^{\mu}+\boldsymbol{\omega}_{\nu}^{\mu} \wedge \boldsymbol{\omega}^{\nu}=0dωμ+ωνμων=0
11 11 ^(11){ }^{11}11 We won't put hats on the indices in this chapter, to prevent clutter.
12 12 ^(12){ }^{12}12 We need the rule that if p ~ p ~ tilde(p)\tilde{\boldsymbol{p}}p~ is a p p ppp-form and q ~ q ~ tilde(q)\tilde{\boldsymbol{q}}q~ is a 1 -form then
p ~ q ~ = ( 1 ) p q q ~ p ~ p ~ q ~ = ( 1 ) p q q ~ p ~ tilde(p)^^ tilde(q)=(-1)^(pq) tilde(q)^^ tilde(p)\tilde{\boldsymbol{p}} \wedge \tilde{\boldsymbol{q}}=(-1)^{p q} \tilde{\boldsymbol{q}} \wedge \tilde{\boldsymbol{p}}p~q~=(1)pqq~p~
We also have the rule that tensor products of 1 -forms and 1 -vectors commute, allowing us to write e μ ω ν = ω ν e μ e μ ω ν = ω ν e μ e_(mu)oxomega^(nu)=omega^(nu)oxe_(mu)\boldsymbol{e}_{\mu} \otimes \boldsymbol{\omega}^{\nu}=\boldsymbol{\omega}^{\nu} \otimes \boldsymbol{e}_{\mu}eμων=ωνeμ.
Example 43.5
Compute as follows:
d ( e μ e ν ) = d ( e μ e ν e ν e μ ) = d e μ e ν + e μ d e ν d e ν e μ e ν d e μ = ω α μ e α e ν + e μ ω β ν e β ω γ μ e γ e μ e ν ω δ ν e δ = ω α μ e α e ν + e μ ω β ν e β ω β μ e β e μ e ν ω α ν e α = ω α μ ( e α e ν ) + ω β ν ( e μ e β ) . d e μ e ν = d e μ e ν e ν e μ = d e μ e ν + e μ d e ν d e ν e μ e ν d e μ = ω α μ e α e ν + e μ ω β ν e β ω γ μ e γ e μ e ν ω δ ν e δ = ω α μ e α e ν + e μ ω β ν e β ω β μ e β e μ e ν ω α ν e α = ω α μ e α e ν + ω β ν e μ e β . {:[d(e_(mu)^^e_(nu))=d(e_(mu)oxe_(nu)-e_(nu)oxe_(mu))],[=de_(mu)oxe_(nu)+e_(mu)ox de_(nu)-de_(nu)oxe_(mu)-e_(nu)ox de_(mu)],[=omega^(alpha)_(mu)oxe_(alpha)oxe_(nu)+e_(mu)oxomega^(beta)_(nu)oxe_(beta)-omega^(gamma)_(mu)oxe_(gamma)oxe_(mu)-e_(nu)oxomega^(delta)_(nu)oxe_(delta)],[=omega^(alpha)_(mu)oxe_(alpha)oxe_(nu)+e_(mu)oxomega^(beta)_(nu)oxe_(beta)-omega^(beta)_(mu)oxe_(beta)oxe_(mu)-e_(nu)oxomega^(alpha)_(nu)oxe_(alpha)],[=omega^(alpha)_(mu)(e_(alpha)^^e_(nu))+omega^(beta)_(nu)(e_(mu)^^e_(beta)).]:}\begin{aligned} \boldsymbol{d}\left(\boldsymbol{e}_{\mu} \wedge \boldsymbol{e}_{\nu}\right) & =\boldsymbol{d}\left(\boldsymbol{e}_{\mu} \otimes \boldsymbol{e}_{\nu}-\boldsymbol{e}_{\nu} \otimes \boldsymbol{e}_{\mu}\right) \\ & =\boldsymbol{d} e_{\mu} \otimes \boldsymbol{e}_{\nu}+\boldsymbol{e}_{\mu} \otimes \boldsymbol{d} \boldsymbol{e}_{\nu}-\boldsymbol{d} \boldsymbol{e}_{\nu} \otimes \boldsymbol{e}_{\mu}-\boldsymbol{e}_{\nu} \otimes \boldsymbol{d} \boldsymbol{e}_{\mu} \\ & =\boldsymbol{\omega}^{\alpha}{ }_{\mu} \otimes \boldsymbol{e}_{\alpha} \otimes \boldsymbol{e}_{\nu}+\boldsymbol{e}_{\mu} \otimes \boldsymbol{\omega}^{\beta}{ }_{\nu} \otimes \boldsymbol{e}_{\beta}-\boldsymbol{\omega}^{\gamma}{ }_{\mu} \otimes \boldsymbol{e}_{\gamma} \otimes \boldsymbol{e}_{\mu}-\boldsymbol{e}_{\nu} \otimes \boldsymbol{\omega}^{\delta}{ }_{\nu} \otimes \boldsymbol{e}_{\delta} \\ & =\boldsymbol{\omega}^{\alpha}{ }_{\mu} \otimes \boldsymbol{e}_{\alpha} \otimes \boldsymbol{e}_{\nu}+\boldsymbol{e}_{\mu} \otimes \boldsymbol{\omega}^{\beta}{ }_{\nu} \otimes \boldsymbol{e}_{\beta}-\boldsymbol{\omega}^{\beta}{ }_{\mu} \otimes \boldsymbol{e}_{\beta} \otimes \boldsymbol{e}_{\mu}-\boldsymbol{e}_{\nu} \otimes \boldsymbol{\omega}^{\alpha}{ }_{\nu} \otimes \boldsymbol{e}_{\alpha} \\ & =\boldsymbol{\omega}^{\alpha}{ }_{\mu}\left(\boldsymbol{e}_{\alpha} \wedge \boldsymbol{e}_{\nu}\right)+\boldsymbol{\omega}^{\beta}{ }_{\nu}\left(\boldsymbol{e}_{\mu} \wedge \boldsymbol{e}_{\beta}\right) . \end{aligned}d(eμeν)=d(eμeνeνeμ)=deμeν+eμdeνdeνeμeνdeμ=ωαμeαeν+eμωβνeβωγμeγeμeνωδνeδ=ωαμeαeν+eμωβνeβωβμeβeμeνωανeα=ωαμ(eαeν)+ωβν(eμeβ).
13 13 ^(13){ }^{13}13 Remember that, in an orthonormal frame, ω μ ν = ω ν μ ω μ ν = ω ν μ omega^(mu nu)=-omega^(nu mu)\boldsymbol{\omega}^{\mu \nu}=-\boldsymbol{\omega}^{\nu \mu}ωμν=ωνμ.
ELECTROMAGNETISM

GRAVITY
V d R = 0 V d R = 0 int_(V)dR=0\int_{\mathcal{V}} \boldsymbol{d} \mathcal{R}=0VdR=0
V R ( w ) = 0 = V d 2 w V R ( w ) = 0 = V d 2 w int_(delV)R(w)=0=int_(delV)d^(2)w\int_{\partial \mathcal{V}} \mathcal{R}(\boldsymbol{w})=0=\int_{\partial \mathcal{V}} \boldsymbol{d}^{2} \boldsymbol{w}VR(w)=0=Vd2w
Fig. 43.3 Boundary of a boundary arguments for electromagnetism (top) and gravity (bottom).
Step III: Working in the orthonormal frame, we define R μ ν = R μ α η α ν R μ ν = R μ α η α ν R^(mu nu)=R^(mu)_(alpha)eta^(alpha nu)\mathcal{R}^{\mu \nu}=\mathcal{R}^{\mu}{ }_{\alpha} \eta^{\alpha \nu}Rμν=Rμαηαν. We can then consider an alternative definition of the curvature operator
(43.34) R = 1 2 ( e μ e ν ) R μ ν (43.34) R = 1 2 e μ e ν R μ ν {:(43.34)R=(1)/(2)(e_(mu)^^e_(nu))R^(mu nu):}\begin{equation*} \mathcal{R}=\frac{1}{2}\left(e_{\mu} \wedge e_{\nu}\right) \mathcal{R}^{\mu \nu} \tag{43.34} \end{equation*}(43.34)R=12(eμeν)Rμν
where 13 13 ^(13){ }^{13}13
Using this, we can combine the results of steps I and II to prove our result.
Example 43.6
Take the derivative of R R R\mathcal{R}R to find
d R = 1 2 d ( e μ e ν ) R μ ν + 1 2 ( e μ e ν ) d R μ ν = 1 2 ( e α e ν ) ( ω α μ R μ ν ) + 1 2 ( e μ e β ) ( ω ν β R μ ν ) d R = 1 2 d e μ e ν R μ ν + 1 2 e μ e ν d R μ ν = 1 2 e α e ν ω α μ R μ ν + 1 2 e μ e β ω ν β R μ ν {:[dR=(1)/(2)d(e_(mu)^^e_(nu))^^R^(mu nu)+(1)/(2)(e_(mu)^^e_(nu))dR^(mu nu)],[=(1)/(2)(e_(alpha)^^e_(nu))(omega^(alpha)_(mu)^^R^(mu nu))+(1)/(2)(e_(mu)^^e_(beta))(omega_(nu)^(beta)^^R^(mu nu))]:}\begin{aligned} \boldsymbol{d} \mathcal{R} & =\frac{1}{2} \boldsymbol{d}\left(\boldsymbol{e}_{\mu} \wedge \boldsymbol{e}_{\nu}\right) \wedge \mathcal{R}^{\mu \nu}+\frac{1}{2}\left(\boldsymbol{e}_{\mu} \wedge \boldsymbol{e}_{\nu}\right) \boldsymbol{d} \mathcal{R}^{\mu \nu} \\ & =\frac{1}{2}\left(\boldsymbol{e}_{\alpha} \wedge \boldsymbol{e}_{\nu}\right)\left(\boldsymbol{\omega}^{\alpha}{ }_{\mu} \wedge \mathcal{R}^{\mu \nu}\right)+\frac{1}{2}\left(\boldsymbol{e}_{\mu} \wedge \boldsymbol{e}_{\beta}\right)\left(\boldsymbol{\omega}_{\nu}^{\beta} \wedge \mathcal{R}^{\mu \nu}\right) \end{aligned}dR=12d(eμeν)Rμν+12(eμeν)dRμν=12(eαeν)(ωαμRμν)+12(eμeβ)(ωνβRμν)
+ 1 2 ( e μ e ν ) d R μ ν + 1 2 e μ e ν d R μ ν +(1)/(2)(e_(mu)^^e_(nu))dR^(mu nu)+\frac{1}{2}\left(e_{\mu} \wedge e_{\nu}\right) d \mathcal{R}^{\mu \nu}+12(eμeν)dRμν
= 1 2 ( e μ e ν ) ( ω α μ R α ν + ω β ν R μ β = 1 2 e μ e ν ω α μ R α ν + ω β ν R μ β =(1)/(2)(e_(mu)^^e_(nu))(omega_(alpha)^(mu)^^R^(alpha nu)+omega_(beta)^(nu)^^R^(mu beta):}=\frac{1}{2}\left(\boldsymbol{e}_{\mu} \wedge \boldsymbol{e}_{\nu}\right)\left(\boldsymbol{\omega}_{\alpha}^{\mu} \wedge \mathcal{R}^{\alpha \nu}+\boldsymbol{\omega}_{\beta}^{\nu} \wedge \mathcal{R}^{\mu \beta}\right.=12(eμeν)(ωαμRαν+ωβνRμβ
R β μ ω ν β + ω α μ R ν α ) R β μ ω ν β + ω α μ R ν α {:-R_(beta)^(mu)^^omega^(nu beta)+omega_(alpha)^(mu)^^R^(nu alpha))\left.-\mathcal{R}_{\beta}^{\mu} \wedge \boldsymbol{\omega}^{\nu \beta}+\omega_{\alpha}^{\mu} \wedge \mathcal{R}^{\nu \alpha}\right)Rβμωνβ+ωαμRνα)
= 1 2 ( e μ e ν ) ( ω β ν R μ β R β μ ω ν β ) = 1 2 e μ e ν ω β ν R μ β R β μ ω ν β =(1)/(2)(e_(mu)^^e_(nu))(omega_(beta)^(nu)^^R^(mu beta)-R_(beta)^(mu)^^omega^(nu beta))=\frac{1}{2}\left(e_{\mu} \wedge e_{\nu}\right)\left(\omega_{\beta}^{\nu} \wedge \mathcal{R}^{\mu \beta}-\mathcal{R}_{\beta}^{\mu} \wedge \omega^{\nu \beta}\right)=12(eμeν)(ωβνRμβRβμωνβ)
(43.36) = 1 2 ( e μ e ν ) ( ω β ν R μ β ω ν β R β μ ) = 0 . (43.36) = 1 2 e μ e ν ω β ν R μ β ω ν β R β μ = 0 . {:(43.36)=(1)/(2)(e_(mu)^^e_(nu))(omega_(beta)^(nu)^^R^(mu beta)-omega^(nu beta)^^R_(beta)^(mu))=0.:}\begin{equation*} =\frac{1}{2}\left(\boldsymbol{e}_{\mu} \wedge \boldsymbol{e}_{\nu}\right)\left(\boldsymbol{\omega}_{\beta}^{\nu} \wedge \mathcal{R}^{\mu \beta}-\boldsymbol{\omega}^{\nu \beta} \wedge \mathcal{R}_{\beta}^{\mu}\right)=0 . \tag{43.36} \end{equation*}(43.36)=12(eμeν)(ωβνRμβωνβRβμ)=0.
In the last line, we've used the fact that ω μ α R ν α = ω μ α R ν α ω μ α R ν α = ω μ α R ν α omega^(mu)_(alpha)^^R^(nu alpha)=omega^(mu alpha)^^R^(nu)_(alpha)\omega^{\mu}{ }_{\alpha} \wedge \mathcal{R}^{\nu \alpha}=\omega^{\mu \alpha} \wedge \mathcal{R}^{\nu}{ }_{\alpha}ωμαRνα=ωμαRνα, and that R μ ν R μ ν R^(mu)_(nu)\mathcal{R}^{\mu}{ }_{\nu}Rμν is a 2-form and ω μ ν ω μ ν omega^(mu)_(nu)\omega^{\mu}{ }_{\nu}ωμν is a 1 -form, so inverting the wedge product does not pick up a sign.
Although the computation was rather lengthy, we have established the Bianchi identity d R = 0 d R = 0 dR=0\boldsymbol{d} \mathcal{R}=0dR=0. This allows us to repeat the general argument above (and in Fig. 43.3) involving the boundary of a boundary. Since d R = 0 d R = 0 dR=0\boldsymbol{d} \mathcal{R}=0dR=0, we have that
(43.37) 0 = V d R = V R . (43.37) 0 = V d R = V R . {:(43.37)0=int_(V)dR=int_(del V)R.:}\begin{equation*} 0=\int_{\mathcal{V}} d \mathcal{R}=\int_{\partial V} \mathcal{R} . \tag{43.37} \end{equation*}(43.37)0=VdR=VR.
Here, R R R\mathcal{R}R is defined by the double derivative of a vector field w w w\boldsymbol{w}w by writing d 2 w = R ( w ) d 2 w = R ( w ) d^(2)w=R(w)\boldsymbol{d}^{2} \boldsymbol{w}=\mathcal{R}(\boldsymbol{w})d2w=R(w). So we're now suggesting that 0 = ν d 2 w = ν d w 0 = ν d 2 w = ν d w 0=int_(del nu)d^(2)w=int_(del del nu)dw0=\int_{\partial \nu} \boldsymbol{d}^{2} \boldsymbol{w}=\int_{\partial \partial \nu} \boldsymbol{d} \boldsymbol{w}0=νd2w=νdw. So we might expect to be able to derive a component version of the Bianchi identity by evaluating the vector-valued 1-form d w = w d w = w dw=grad w\boldsymbol{d} \boldsymbol{w}=\boldsymbol{\nabla} \boldsymbol{w}dw=w round a boundary, and setting it equal to zero. Equivalently, we can evaluate the surface integral of d 2 w d 2 w d^(2)w\boldsymbol{d}^{2} \boldsymbol{w}d2w and obtain the same answer. We shall do the latter. 14 14 ^(14){ }^{14}14
To evaluate our integral, we need the result of the inner product of the form d 2 w d 2 w d^(2)w\boldsymbol{d}^{2} \boldsymbol{w}d2w and a surface element, as examined in the next example.
Example 43.7
Let's consider d α ~ , u v d α ~ , u v (:d tilde(alpha),u^^v:)\langle\boldsymbol{d} \tilde{\boldsymbol{\alpha}}, \boldsymbol{u} \wedge \boldsymbol{v}\rangledα~,uv, where α ~ α ~ tilde(alpha)\tilde{\boldsymbol{\alpha}}α~ is a 1-form and u u u\boldsymbol{u}u and v v v\boldsymbol{v}v are vectors. This object counts the number of cells of the 2 -form d α ~ d α ~ d tilde(alpha)d \tilde{\alpha}dα~ in the parallelogram formed by the bivector u v . 15 u v . 15 u^^v.^(15)\boldsymbol{u} \wedge \boldsymbol{v} .{ }^{15}uv.15 It can be rewritten as 16 16 ^(16){ }^{16}16
d α ~ , u v = d α ~ , u v d α ~ , v u (43.38) = u α ~ , v v α ~ , u d α ~ , u v = d α ~ , u v d α ~ , v u (43.38) = u α ~ , v v α ~ , u {:[(:d tilde(alpha)","u^^v:)=(:d tilde(alpha)","u ox v:)-(:d tilde(alpha)","v ox u:)],[(43.38)=(:grad_(u)( tilde(alpha)),v:)-(:grad_(v)( tilde(alpha)),u:)]:}\begin{align*} \langle\boldsymbol{d} \tilde{\boldsymbol{\alpha}}, \boldsymbol{u} \wedge \boldsymbol{v}\rangle & =\langle\boldsymbol{d} \tilde{\boldsymbol{\alpha}}, \boldsymbol{u} \otimes \boldsymbol{v}\rangle-\langle\boldsymbol{d} \tilde{\boldsymbol{\alpha}}, \boldsymbol{v} \otimes \boldsymbol{u}\rangle \\ & =\left\langle\nabla_{u} \tilde{\alpha}, \boldsymbol{v}\right\rangle-\left\langle\nabla_{v} \tilde{\alpha}, \boldsymbol{u}\right\rangle \tag{43.38} \end{align*}dα~,uv=dα~,uvdα~,vu(43.38)=uα~,vvα~,u
Expanding the actions of the covariant derivative on forms, we have 17 17 ^(17){ }^{17}17
u α ~ , v v α ~ , u = u α ~ , v α ~ , u v v α ~ , u + α ~ , v u (43.39) = u α ~ , v v α ~ , u α ~ , [ u , v ] u α ~ , v v α ~ , u = u α ~ , v α ~ , u v v α ~ , u + α ~ , v u (43.39) = u α ~ , v v α ~ , u α ~ , [ u , v ] {:[(:grad_(u)( tilde(alpha)),v:)-(:grad_(v)( tilde(alpha)),u:)=grad_(u)(: tilde(alpha)","v:)-(:( tilde(alpha)),grad_(u)v:)-grad_(v)(: tilde(alpha)","u:)+(:( tilde(alpha)),grad_(v)u:)],[(43.39)=grad_(u)(: tilde(alpha)","v:)-grad_(v)(: tilde(alpha)","u:)-(: tilde(alpha)","[u","v]:)]:}\begin{align*} \left\langle\nabla_{u} \tilde{\boldsymbol{\alpha}}, \boldsymbol{v}\right\rangle-\left\langle\nabla_{v} \tilde{\boldsymbol{\alpha}}, \boldsymbol{u}\right\rangle & =\boldsymbol{\nabla}_{u}\langle\tilde{\boldsymbol{\alpha}}, \boldsymbol{v}\rangle-\left\langle\tilde{\boldsymbol{\alpha}}, \nabla_{u} \boldsymbol{v}\right\rangle-\nabla_{v}\langle\tilde{\boldsymbol{\alpha}}, \boldsymbol{u}\rangle+\left\langle\tilde{\boldsymbol{\alpha}}, \nabla_{v} \boldsymbol{u}\right\rangle \\ & =\boldsymbol{\nabla}_{u}\langle\tilde{\boldsymbol{\alpha}}, \boldsymbol{v}\rangle-\nabla_{\boldsymbol{v}}\langle\tilde{\boldsymbol{\alpha}}, \boldsymbol{u}\rangle-\langle\tilde{\boldsymbol{\alpha}},[\boldsymbol{u}, \boldsymbol{v}]\rangle \tag{43.39} \end{align*}uα~,vvα~,u=uα~,vα~,uvvα~,u+α~,vu(43.39)=uα~,vvα~,uα~,[u,v]
This result generalizes to any tensor-valued 1-form S S S\boldsymbol{S}S, giving
(43.40) d S , u v = u S , v v S , u S , [ u , v ] (43.40) d S , u v = u S , v v S , u S , [ u , v ] {:(43.40)(:dS","u^^v:)=grad_(u)(:S","v:)-grad_(v)(:S","u:)-(:S","[u","v]:):}\begin{equation*} \langle d S, u \wedge v\rangle=\nabla_{u}\langle S, v\rangle-\nabla_{v}\langle S, u\rangle-\langle S,[u, v]\rangle \tag{43.40} \end{equation*}(43.40)dS,uv=uS,vvS,uS,[u,v]
For the special case at hand, involving the vector-valued 1-form d w d w dw\boldsymbol{d} \boldsymbol{w}dw, we use d w , u = d w , u = (:dw,u:)=\langle\boldsymbol{d} \boldsymbol{w}, \boldsymbol{u}\rangle=dw,u= u w u w grad_(u)w\boldsymbol{\nabla}_{\boldsymbol{u}} \boldsymbol{w}uw and find
d 2 w , u v = u v w v u w [ u , v ] w (43.41) = R ^ ( u , v ) w d 2 w , u v = u v w v u w [ u , v ] w (43.41) = R ^ ( u , v ) w {:[(:d^(2)w,u^^v:)=grad_(u)grad_(v)w-grad_(v)grad_(u)w-grad_([u,v])w],[(43.41)= hat(R)(u","v)w]:}\begin{align*} \left\langle d^{2} w, u \wedge v\right\rangle & =\nabla_{u} \nabla_{v} w-\nabla_{v} \nabla_{u} w-\nabla_{[u, v]} w \\ & =\hat{R}(u, v) w \tag{43.41} \end{align*}d2w,uv=uvwvuw[u,v]w(43.41)=R^(u,v)w
where, as before, R ^ R ^ hat(R)\hat{\boldsymbol{R}}R^ is the covariant Riemann curvature operator [ u , v ] [ u , v ] u , v [ u , v ] [grad_(u),grad_(v)]-grad_([u,v])\left[\boldsymbol{\nabla}_{\boldsymbol{u}}, \boldsymbol{\nabla}_{\boldsymbol{v}}\right]-\boldsymbol{\nabla}_{[\boldsymbol{u}, \boldsymbol{v}]}[u,v][u,v].
In the electromagnetic case we examined in Example 43.3, we had F , u F , u (:F,u^^\langle\boldsymbol{F}, \boldsymbol{u} \wedgeF,u v = F ( u , v ) v = F ( u , v ) v:)=F(u,v)\boldsymbol{v}\rangle=\boldsymbol{F}(\boldsymbol{u}, \boldsymbol{v})v=F(u,v), allowing us to evaluate the surface integral over a closed surface. The Bianchi identity d F ~ = 0 d F ~ = 0 d tilde(F)=0\boldsymbol{d} \tilde{\boldsymbol{F}}=0dF~=0 allowed us to set this to zero. In the same way, we have the equation
(43.42) d 2 w , u v = R ^ ( u , v ) w (43.42) d 2 w , u v = R ^ ( u , v ) w {:(43.42)(:d^(2)w,u^^v:)= hat(R)(u","v)w:}\begin{equation*} \left\langle\boldsymbol{d}^{2} \boldsymbol{w}, \boldsymbol{u} \wedge \boldsymbol{v}\right\rangle=\hat{\boldsymbol{R}}(\boldsymbol{u}, \boldsymbol{v}) \boldsymbol{w} \tag{43.42} \end{equation*}(43.42)d2w,uv=R^(u,v)w
which allows us to do the surface integral, which we know from d R = 0 d R = 0 dR=0\boldsymbol{d} \mathcal{R}=0dR=0, must be zero. This provides a coordinate version of the Bianchi identity. All that's left to do is to rerun the argument in Example 43.3. For simplicity we use Riemann normal coordinates. 18 18 ^(18){ }^{18}18 The outward contribution from the the y z y z yzy zyz face of the cube at x + Δ x x + Δ x x+Delta xx+\Delta xx+Δx is given by
(43.43) R ^ ( e y , e z ) w Δ y Δ z (43.43) R ^ e y , e z w Δ y Δ z {:(43.43) hat(R)(e_(y),e_(z))w Delta y Delta z:}\begin{equation*} \hat{\boldsymbol{R}}\left(\boldsymbol{e}_{y}, \boldsymbol{e}_{z}\right) \boldsymbol{w} \Delta y \Delta z \tag{43.43} \end{equation*}(43.43)R^(ey,ez)wΔyΔz
In coordinates, this is
(43.44) R β y z α ( at x + Δ x ) w β Δ y Δ z (43.44) R β y z α (  at  x + Δ x ) w β Δ y Δ z {:(43.44)R_(beta yz)^(alpha)(" at "x+Delta x)w^(beta)Delta y Delta z:}\begin{equation*} R_{\beta y z}^{\alpha}(\text { at } x+\Delta x) w^{\beta} \Delta y \Delta z \tag{43.44} \end{equation*}(43.44)Rβyzα( at x+Δx)wβΔyΔz
14 14 ^(14){ }^{14}14 The former approach is followed in Misner, Thorne, and Wheeler. You can see how taking w w grad w\boldsymbol{\nabla} \boldsymbol{w}w around the boundary resembles the process in Example 35.5 for obtaining the components of R R R\boldsymbol{R}R. In this case, each loop comprises the face of the infinitesimal cube, and opposite faces are taken in different directions, leading to the derivatives. The details are left as an exercise.
15 15 ^(15){ }^{15}15 As mentioned in Chapter 32, this quantity is equivalent to the inner product d α ~ , u v d α ~ , u v (:d tilde(alpha),u^^v:)\langle\boldsymbol{d} \tilde{\boldsymbol{\alpha}}, \boldsymbol{u} \wedge \boldsymbol{v}\rangledα~,uv.
16 16 ^(16){ }^{16}16 If in doubt, think of d α ~ d α ~ d tilde(alpha)\boldsymbol{d} \tilde{\alpha}dα~ as an object with two slots, which we'll fill with vectors u u u\boldsymbol{u}u and v v v\boldsymbol{v}v in order, via the object u v u v u ox v\boldsymbol{u} \otimes \boldsymbol{v}uv. Putting the vector u u u\boldsymbol{u}u in the first (d) slot gives u d u = u u d u = u u*d-=u*grad=grad_(u)\boldsymbol{u} \cdot \boldsymbol{d} \equiv \boldsymbol{u} \cdot \boldsymbol{\nabla}=\boldsymbol{\nabla}_{\boldsymbol{u}}udu=u. This was verified in Exercise 34.6.
17 17 ^(17){ }^{17}17 We note the recipe of how to take the covariant derivative of a 1 -form like α ~ α ~ tilde(alpha)\tilde{\boldsymbol{\alpha}}α~ is given by considering the inner product u α ~ , w = u α ~ , w + α ~ , u w u α ~ , w = u α ~ , w + α ~ , u w grad_(u)(: tilde(alpha),w:)=(:grad_(u)( tilde(alpha)),w:)+(:( tilde(alpha)),grad_(u)w:)\boldsymbol{\nabla}_{\boldsymbol{u}}\langle\tilde{\boldsymbol{\alpha}}, \boldsymbol{w}\rangle=\left\langle\boldsymbol{\nabla}_{u} \tilde{\boldsymbol{\alpha}}, \boldsymbol{w}\right\rangle+\left\langle\tilde{\boldsymbol{\alpha}}, \boldsymbol{\nabla}_{u} \boldsymbol{w}\right\rangleuα~,w=uα~,w+α~,uw.
18 18 ^(18){ }^{18}18 See Chapter 35 for a description of these. In short, they describe a local inertial frame, so the vanishing connection coefficients at the origin imply that we can treat ordinary derivatives as equivalent to covariant ones in this treatment.
The opposite face gives a similar contribution, except that the sign is reversed and we evaluate at point x x xxx rather than x + Δ x x + Δ x x+Delta xx+\Delta xx+Δx. The combination yields
(43.45) R α β y z x w β Δ x Δ y Δ z . (43.45) R α β y z x w β Δ x Δ y Δ z . {:(43.45)(delR^(alpha)_(beta yz))/(del x)*w^(beta)Delta x Delta y Delta z.:}\begin{equation*} \frac{\partial R^{\alpha}{ }_{\beta y z}}{\partial x} \cdot w^{\beta} \Delta x \Delta y \Delta z . \tag{43.45} \end{equation*}(43.45)RαβyzxwβΔxΔyΔz.
Repeating for all of the other faces, summing and setting equal to zero, we obtain the Bianchi identity for gravitation in component form
(43.46) R β ^ y ^ z ^ , x ^ x ^ + R β ^ z ^ x ^ , y ^ y ^ + R β ^ x ^ y ^ , z ^ α ^ = 0 , (43.46) R β ^ y ^ z ^ , x ^ x ^ + R β ^ z ^ x ^ , y ^ y ^ + R β ^ x ^ y ^ , z ^ α ^ = 0 , {:(43.46)R_( hat(beta) hat(y) hat(z), hat(x))^( hat(x))+R_( hat(beta) hat(z) hat(x), hat(y))^( hat(y))+R_( hat(beta) hat(x) hat(y), hat(z))^( hat(alpha))=0",":}\begin{equation*} R_{\hat{\beta} \hat{y} \hat{z}, \hat{x}}^{\hat{x}}+R_{\hat{\beta} \hat{z} \hat{x}, \hat{y}}^{\hat{y}}+R_{\hat{\beta} \hat{x} \hat{y}, \hat{z}}^{\hat{\alpha}}=0, \tag{43.46} \end{equation*}(43.46)Rβ^y^z^,x^x^+Rβ^z^x^,y^y^+Rβ^x^y^,z^α^=0,
where we've reinstated hats to stress that this is true in the local inertial frame. This valid tensor equation is also true in a general coordinate frame on using the, by now familiar, commas go to semicolons rule. 19 19 ^(19){ }^{19}19 Finally, we can generalize the cube so that it's described by any set of coordinate axes. This gives a final expression for the Bianchi identity of 20 20 ^(20){ }^{20}20
19 19 ^(19){ }^{19}19 That is, we have the curved space time equation
R α β y z ; x + R α β z x ; y + R α β x y ; z = 0 R α β y z ; x + R α β z x ; y + R α β x y ; z = 0 R^(alpha)_(beta yz;x)+R^(alpha)_(beta zx;y)+R^(alpha)_(beta xy;z)=0R^{\alpha}{ }_{\beta y z ; x}+R^{\alpha}{ }_{\beta z x ; y}+R^{\alpha}{ }_{\beta x y ; z}=0Rαβyz;x+Rαβzx;y+Rαβxy;z=0.
20 20 ^(20){ }^{20}20 In tensor notation, we can also write
(43.48) R β [ μ ν ; λ ] α = 0 (43.48) R β [ μ ν ; λ ] α = 0 {:(43.48)R_(beta[mu nu;lambda])^(alpha)=0:}\begin{equation*} R_{\beta[\mu \nu ; \lambda]}^{\alpha}=0 \tag{43.48} \end{equation*}(43.48)Rβ[μν;λ]α=0
21 21 ^(21){ }^{21}21 The components of C C CCC are given by
C μ ν α β = R μ ν α β 1 2 R μ α g ν β C μ ν α β = R μ ν α β 1 2 R μ α g ν β C_(mu nu alpha beta)=R_(mu nu alpha beta)-(1)/(2)R_(mu alpha)g_(nu beta)C_{\mu \nu \alpha \beta}=R_{\mu \nu \alpha \beta}-\frac{1}{2} R_{\mu \alpha} g_{\nu \beta}Cμναβ=Rμναβ12Rμαgνβ + 1 2 R μ β g ν α + 1 2 R ν α g μ β + 1 2 R μ β g ν α + 1 2 R ν α g μ β +(1)/(2)R_(mu beta)g_(nu alpha)+(1)/(2)R_(nu alpha)g_(mu beta)+\frac{1}{2} R_{\mu \beta} g_{\nu \alpha}+\frac{1}{2} R_{\nu \alpha} g_{\mu \beta}+12Rμβgνα+12Rναgμβ 1 2 R ν β g μ α + 1 6 R ( g μ α g ν β 1 2 R ν β g μ α + 1 6 R g μ α g ν β -(1)/(2)R_(nu beta)g_(mu alpha)+(1)/(6)R(g_(mu alpha)g_(nu beta):}-\frac{1}{2} R_{\nu \beta} g_{\mu \alpha}+\frac{1}{6} R\left(g_{\mu \alpha} g_{\nu \beta}\right.12Rνβgμα+16R(gμαgνβ
g μ β g ν α ) g μ β g ν α {:-g_(mu beta)g_(nu alpha))\left.-g_{\mu \beta} g_{\nu \alpha}\right)gμβgνα).
(43.49) R β μ ν ; λ α + R β ν λ ; μ α + R β λ μ ; ν α = 0 . (43.49) R β μ ν ; λ α + R β ν λ ; μ α + R β λ μ ; ν α = 0 . {:(43.49)R_(beta mu nu;lambda)^(alpha)+R_(beta nu lambda;mu)^(alpha)+R_(beta lambda mu;nu)^(alpha)=0.:}\begin{equation*} R_{\beta \mu \nu ; \lambda}^{\alpha}+R_{\beta \nu \lambda ; \mu}^{\alpha}+R_{\beta \lambda \mu ; \nu}^{\alpha}=0 . \tag{43.49} \end{equation*}(43.49)Rβμν;λα+Rβνλ;μα+Rβλμ;να=0.
This is the version of the equation we used in Chapter 13.
This chapter has given us a geometrical sense of where the Bianchi identity comes from, both in electromagnetism and in gravitation. There is one final link that we can make between these theories using the Bianchi identity. It is tempting to identify the Einstein field equation with the Maxwell equations, since these are the key equation of motion for both theories. However, a closer analogy can be made between the Maxwell equations and the Weyl tensor C C C\boldsymbol{C}C that we met in Chapter 35. 21 21 ^(21){ }^{21}21 This is examined in the next example.

Example 43.8

Since the Weyl tensor collects those parts of the Riemann tensor that do not appear in the Einstein field equation, it would seem that C C CCC represents that part of the spacetime curvature that is not generated locally by the matter distribution. The components of the Riemann tensor obey the Bianchi identity, which can be rewritten 22 22 ^(22){ }^{22}22 in terms of the components of C C C\boldsymbol{C}C as
(43.51) C μ ν α β = J μ ν α , (43.51) C μ ν α β = J μ ν α , {:(43.51)C^(mu nu alpha beta)=J^(mu nu alpha)",":}\begin{equation*} C^{\mu \nu \alpha \beta}=J^{\mu \nu \alpha}, \tag{43.51} \end{equation*}(43.51)Cμναβ=Jμνα,
where J μ ν α = R α [ μ ; ν ] + 1 6 g α [ ν R μ μ ] J μ ν α = R α [ μ ; ν ] + 1 6 g α [ ν R μ μ ] J^(mu nu alpha)=R^(alpha[mu;nu])+(1)/(6)g^(alpha[nu)R^(mu mu])J^{\mu \nu \alpha}=R^{\alpha[\mu ; \nu]}+\frac{1}{6} g^{\alpha[\nu} R^{\mu \mu]}Jμνα=Rα[μ;ν]+16gα[νRμμ]. Since eqn 43.51 looks so much like the Maxwell equations F α β i β = J α F α β i β = J α F^(alpha beta)_(i beta)=J^(alpha)F^{\alpha \beta}{ }_{i \beta}=J^{\alpha}Fαβiβ=Jα, we can think of the Bianchi identity as a field equation for the Weyl tensor, telling us how matter determines curvature at other points in spacetime, just as currents determine electromagnetic field lines in other parts of spacetime.
In the next chapter, we turn to another important aspect of classical field theory that plays a major role in gravitation: the notion of a gauge field.

Chapter summary

  • Charge conservation can be written as
(43.52) 0 = V d J = V J (43.52) 0 = V d J = V J {:(43.52)0=int_(V)d***J=int_(delV)***J:}\begin{equation*} 0=\int_{\mathcal{V}} \boldsymbol{d} \star \boldsymbol{J}=\int_{\partial \mathcal{V}} \star \boldsymbol{J} \tag{43.52} \end{equation*}(43.52)0=VdJ=VJ
  • The Bianchi identity for electromagnetism is a component version of the expression d F ~ = 0 d F ~ = 0 d tilde(F)=0\boldsymbol{d} \tilde{\boldsymbol{F}}=0dF~=0. The flat-space component equation reads
(43.53) F α β , γ + F γ α , β + F β γ , α = 0 (43.53) F α β , γ + F γ α , β + F β γ , α = 0 {:(43.53)F_(alpha beta,gamma)+F_(gamma alpha,beta)+F_(beta gamma,alpha)=0:}\begin{equation*} F_{\alpha \beta, \gamma}+F_{\gamma \alpha, \beta}+F_{\beta \gamma, \alpha}=0 \tag{43.53} \end{equation*}(43.53)Fαβ,γ+Fγα,β+Fβγ,α=0
  • The gravitational Bianchi identity can be written as
(43.54) d R = 0 (43.54) d R = 0 {:(43.54)dR=0:}\begin{equation*} \boldsymbol{d} \mathcal{R}=0 \tag{43.54} \end{equation*}(43.54)dR=0
In components, this yields a curved-space equation
(43.55) R β μ ν ; λ α + R β ν λ ; μ α + R β λ μ ; ν α = 0 (43.55) R β μ ν ; λ α + R β ν λ ; μ α + R β λ μ ; ν α = 0 {:(43.55)R_(beta mu nu;lambda)^(alpha)+R_(beta nu lambda;mu)^(alpha)+R_(beta lambda mu;nu)^(alpha)=0:}\begin{equation*} R_{\beta \mu \nu ; \lambda}^{\alpha}+R_{\beta \nu \lambda ; \mu}^{\alpha}+R_{\beta \lambda \mu ; \nu}^{\alpha}=0 \tag{43.55} \end{equation*}(43.55)Rβμν;λα+Rβνλ;μα+Rβλμ;να=0

Exercises

(43.1) Verify that the Bianchi identity can also be written as
(43.56) R [ μ ν ; λ ] α = 0 . (43.56) R [ μ ν ; λ ] α = 0 . {:(43.56)R_([mu nu;lambda])^(alpha)=0.:}\begin{equation*} R_{[\mu \nu ; \lambda]}^{\alpha}=0 . \tag{43.56} \end{equation*}(43.56)R[μν;λ]α=0.
(43.2) Using the contracted Bianchi identity
(43.57) R λ ; α α = 1 2 R ; λ (43.57) R λ ; α α = 1 2 R ; λ {:(43.57)R_(lambda;alpha)^(alpha)=(1)/(2)R_(;lambda):}\begin{equation*} R_{\lambda ; \alpha}^{\alpha}=\frac{1}{2} R_{; \lambda} \tag{43.57} \end{equation*}(43.57)Rλ;αα=12R;λ
that was derived in Chapter 13, verify that eqn 43.51 reduces to (a contraction of) the Bianchi identity.
(43.3) Conservation of the source of gravitational curvature is written as T = 0 T = 0 grad*T=0\boldsymbol{\nabla} \cdot \boldsymbol{T}=0T=0, but it can also be written as
(43.58) d T = 0 . (43.58) d T = 0 . {:(43.58)d***T=0.:}\begin{equation*} d \star T=0 . \tag{43.58} \end{equation*}(43.58)dT=0.
Since the Einstein equations are simply G = 8 π T G = 8 π T G=8pi T\boldsymbol{G}=8 \pi \boldsymbol{T}G=8πT the dual form is
(43.59) G = 8 π T . (43.59) G = 8 π T . {:(43.59)***G=8pi***T.:}\begin{equation*} \star \boldsymbol{G}=8 \pi \star \boldsymbol{T} . \tag{43.59} \end{equation*}(43.59)G=8πT.
We shall find expressions for T T ***T\star \boldsymbol{T}T and G G ***G\star \boldsymbol{G}G.
Take the dual of the vector part of the ( 1 , 1 ) ( 1 , 1 ) (1,1)(1,1)(1,1) version
of T = T μ ν e μ ω ν T = T μ ν e μ ω ν T=T^(mu)_(nu)e_(mu)oxomega^(nu)\boldsymbol{T}=T^{\mu}{ }_{\nu} \boldsymbol{e}_{\mu} \otimes \boldsymbol{\omega}^{\nu}T=Tμνeμων to show that
(43.60) T = T μ ν ( e μ d Σ ν ) (43.60) T = T μ ν e μ d Σ ν {:(43.60)***T=T^(mu nu)(e_(mu)oxdSigma_(nu)):}\begin{equation*} \star \boldsymbol{T}=T^{\mu \nu}\left(\boldsymbol{e}_{\mu} \otimes \mathrm{d} \boldsymbol{\Sigma}_{\nu}\right) \tag{43.60} \end{equation*}(43.60)T=Tμν(eμdΣν)
and
(43.61) G = G μ ν ( e μ d Σ ν ) , (43.61) G = G μ ν e μ d Σ ν , {:(43.61)***G=G^(mu nu)(e_(mu)oxdSigma_(nu))",":}\begin{equation*} \star \boldsymbol{G}=G^{\mu \nu}\left(\boldsymbol{e}_{\mu} \otimes \mathrm{d} \boldsymbol{\Sigma}_{\nu}\right), \tag{43.61} \end{equation*}(43.61)G=Gμν(eμdΣν),
where d Σ ν d Σ ν dSigma_(nu)\mathrm{d} \boldsymbol{\Sigma}_{\nu}dΣν is the surface 3 -form.
(43.4) Recall the identity from Exercise 34.3, valid in a coordinate frame, that
(43.62) Γ β α α = x β ln ( g ) . (43.62) Γ β α α = x β ln ( g ) . {:(43.62)Gamma_(beta alpha)^(alpha)=(del)/(delx^(beta))ln(sqrt(-g)).:}\begin{equation*} \Gamma_{\beta \alpha}^{\alpha}=\frac{\partial}{\partial x^{\beta}} \ln (\sqrt{-g}) . \tag{43.62} \end{equation*}(43.62)Γβαα=xβln(g).
Use this to show that d T = 0 d T = 0 d***T=0d \star T=0dT=0, as we claimed in the previous question.
(43.5) Show that we must have
(43.63) V G = 0 . (43.63) V G = 0 . {:(43.63)int_(delV)***G=0.:}\begin{equation*} \int_{\partial \mathcal{V}} \star \boldsymbol{G}=0 . \tag{43.63} \end{equation*}(43.63)VG=0.
See Misner, Thorne, and Wheeler for a discussion of the physical content of this equation.

44

44.1 Fibre bundles and gauge invariance strength
Chapter summary 480
Exercises
R Gauge theory is very help ful for understanding weak ful for understanding weak gravitational fields (Chap-
ter 45) and gravitational waves ter 45) and gravitational waves
(Chapter 46). It also forms (Chapter 46). It also forms
the basis of high-dimensional the basis of high-dim
theories (Chapter 48).
1 1 ^(1){ }^{1}1 'Gauge' is an unfortunate term in this context, referring originally to the thickness of metal wires and rails, but we are stuck with it. In field theory, it derives from Hermann Weyl's use of the term in general relativity that we discuss in Chapter 45
Fig. 44.1 (a) A representation of the field ρ ( x ) e i θ ( x ) ρ ( x ) e i θ ( x ) rho(x)e^(itheta(x))\rho(x) \mathrm{e}^{\mathrm{i} \theta(x)}ρ(x)eiθ(x), where the angle of each arrow to the vertical represents θ ( x ) θ ( x ) theta(x)\theta(x)θ(x) at a particular point. (b) The effect of a global phase transformation θ ( x ) θ ( x ) theta(x)rarr\theta(x) \rightarrowθ(x) θ ( x ) + α θ ( x ) + α theta(x)+alpha\theta(x)+\alphaθ(x)+α.

Gauge fields

If you leave a thing alone you leave it to a torrent of change.
G. K. Chesterton (1874-1936) Heretics
Suppose you move through some distance in space and you measure a change in a vector. There are two possible reasons for the change: (i) the intrinsic change of the vector field with position and (ii) the change, with position, of the coordinate system you are using. In Chapter 7, these factors led to our definition of the covariant derivative, in words and coordinates respectively, as
( Covariant derivative ) = ( Change in vector ) ( Change due to coordinate system ) (44.1) ( α v ) μ = v μ x α + Γ μ α β v β . (  Covariant   derivative  ) = (  Change   in vector  ) (  Change due to   coordinate system  ) (44.1) α v μ = v μ x α + Γ μ α β v β . {:[((" Covariant ")/(" derivative "))=((" Change ")/(" in vector "))-((" Change due to ")/(" coordinate system "))],[(44.1)(grad_(alpha)v)^(mu)=(delv^(mu))/(delx^(alpha))+Gamma^(mu)_(alpha beta)v^(beta).]:}\begin{align*} \binom{\text { Covariant }}{\text { derivative }} & =\binom{\text { Change }}{\text { in vector }}-\binom{\text { Change due to }}{\text { coordinate system }} \\ \left(\boldsymbol{\nabla}_{\alpha} \boldsymbol{v}\right)^{\mu} & =\frac{\partial v^{\mu}}{\partial x^{\alpha}}+\Gamma^{\mu}{ }_{\alpha \beta} v^{\beta} . \tag{44.1} \end{align*}( Covariant  derivative )=( Change  in vector )( Change due to  coordinate system )(44.1)(αv)μ=vμxα+Γμαβvβ.
This effect of two contributions to the change in a vector with position is closely related to the physics of phases and gauges 1 1 ^(1){ }^{1}1 in field theory. In particular, there is an illuminating analogy between the covariant derivative μ μ grad_(mu)\nabla_{\mu}μ in relativity and the covariant derivative D μ D μ D_(mu)D_{\mu}Dμ used to describe field theories with local symmetries. In this chapter, we explore this link.

44.1 Fibre bundles and gauge invariance

Fields depend on variables like x x xxx, the position in spacetime. We can think of the field ψ ψ psi\psiψ stretching out over all time and space, such that there is a value of ψ ( x ) ψ ( x ) psi(x)\psi(x)ψ(x) at each point in spacetime. A field theory is described by a Lagrangian L L L\mathcal{L}L. If we perform a transformation of the coordinates that feature in the theory and obtain the same value of L L L\mathcal{L}L, then the equations of motion that one derives from the Lagrangian are unchanged, and we say that the theory has a symmetry. In addition to their dependence on coordinate variables like x x xxx, fields can have other, internal variables such as a phase θ θ theta\thetaθ that causes the field carry around a factor e i θ e i θ e^(itheta)\mathrm{e}^{\mathrm{i} \theta}eiθ. An example is a field, ψ ( x ) = ρ ( x ) e i θ ( x ) ψ ( x ) = ρ ( x ) e i θ ( x ) psi(x)=rho(x)e^(itheta(x))\psi(x)=\rho(x) \mathrm{e}^{\mathrm{i} \theta(x)}ψ(x)=ρ(x)eiθ(x) which could be visualized as having an amplitude ρ ρ rho\rhoρ at each point in space, as well as a phase θ θ theta\thetaθ, which could be represented in a diagram by an phasor arrow, as shown in Fig. 44.1(a). We can perform transformations on the phase itself, perhaps by making the change θ ( x ) θ ( x ) + α θ ( x ) θ ( x ) + α theta(x)rarr theta(x)+alpha\theta(x) \rightarrow \theta(x)+\alphaθ(x)θ(x)+α, which amounts to a rotation of all of the arrows by an angle α α alpha\alphaα as shown in Fig. 44.1(b). If
the Lagrangian does not change under this transformation then we say that we have an internal symmetry.
Mathematically speaking, despite being a function of the position in spacetime x x xxx, the phase θ ( x ) θ ( x ) theta(x)\theta(x)θ(x) does not itself live in (3+1)-dimensional spacetime. So the change in the phase does not correspond to any change in the spatial coordinates, which is to say that the rotation of the arrows is a rotation in the internal space of the field. To understand these internal variables, we make use of the mathematical notion of a fibre bundle over spacetime. 2 2 ^(2){ }^{2}2 We imagine that floating above each point in spacetime is another space called a fibre, described by the internal variables. In the mathematical language, we call the usual (3+1)-dimensional spacetime the base space M M M\mathcal{M}M. The fibre bundle B B B\mathcal{B}B is a structure formed from combining a fibre V V V\mathcal{V}V at each point in M M M\mathcal{M}M (or, in Roger Penrose's words, 'an M M M\mathcal{M}M worth of V V V\mathcal{V}V 's'). The bundle B B B\mathcal{B}B can then be thought of as comprising M M M\mathcal{M}M with the structure floating above M M M\mathcal{M}M, as shown in Fig. 44.2.
Example 44.1
  • We have met such a structure before when we considered tangent vectors. A vector v v v\boldsymbol{v}v that is tangent to a curve at a point P P P\mathcal{P}P in a spacetime manifold M M M\mathcal{M}M, lives in a vector space known as a tangent space, denoted T P T P T_(P)\mathcal{T}_{\mathcal{P}}TP that can be thought of as floating above the point. The components of the vector can then be thought of as being stored in a fibre (that is, in the tangent space T P T P T_(P)\mathcal{T}_{\mathcal{P}}TP ) above P P P\mathcal{P}P. There is a different tangent space for each point in the spacetime. Combining all of the tangent spaces, we make a tangent bundle T M T M TM\mathcal{T} \mathcal{M}TM.
  • Another example of bundles in physics is isospin. Many matter fields act as if they have an internal switch that allows us to change the identity of particle excitations at a particular point. This internal variable is isospin and can be treated in terms of fibres.
  • A slightly different example is Newtonian spacetime, which is formed from a one-dimensional manifold R R RRR that corresponds to points in time. Floating above each point in time is a three-dimensional space that gives the configuration of the system in space at that time. 3 3 ^(3){ }^{3}3
In our example of a field ψ ψ psi\psiψ with a phase, it is the phase variable θ θ theta\thetaθ that is stored in a fibre above each point in spacetime. Since we want the field to be single valued, the angle θ θ theta\thetaθ must be the same as θ + 2 π θ + 2 π theta+2pi\theta+2 \piθ+2π, and we can visualize each of the fibres as a unit circle floating above a point in spacetime as shown (for two spatial dimensions) in Fig. 44.3. (This is, of course, equivalent to our picture of arrows in Fig. 44.1, but making the unit circles explicit is helpful for later applications.) The global rotation of the phases by an amount α α alpha\alphaα, simply adds α α alpha\alphaα to the values stored in each of the fibres. If in doubt, think of each unit circle fibre as a knob that can be turned. The global rotation corresponds to twisting all of the knobs by the same angle α α alpha\alphaα.
A very special property of fields is revealed if we examine the transformation of the internal phase variable in detail. Let's confine ourselves to flat space and consider the so-called complex scalar field theory. This theory is built from complex-number-valued fields ψ ψ psi\psiψ that are de-
Fig. 44.2 A one-dimensional manifold M M M\mathcal{M}M has a fibre V V V\mathcal{V}V floating above each point. The combination of an M M M\mathcal{M}M worth of V V V\mathcal{V}V s, makes the bundle B B B\mathcal{B}B.
2 2 ^(2){ }^{2}2 Fibre bundles are described more mathematically in Appendix C.
3 3 ^(3){ }^{3}3 This structure is what general relativity replaces with our notion of metrics defined on a manifold.
Fig. 44.3 View from above of a twodimensional space with a circular fibre (or knob) floating above each point.
scribed by the matter-field Lagrangian
(44.2) L = ( μ ψ ) ( μ ψ ) m 2 ψ ψ (44.2) L = μ ψ μ ψ m 2 ψ ψ {:(44.2)L=-(del^(mu)psi)^(†)(del_(mu)psi)-m^(2)psi^(†)psi:}\begin{equation*} \mathcal{L}=-\left(\partial^{\mu} \psi\right)^{\dagger}\left(\partial_{\mu} \psi\right)-m^{2} \psi^{\dagger} \psi \tag{44.2} \end{equation*}(44.2)L=(μψ)(μψ)m2ψψ
4 4 ^(4){ }^{4}4 To keep our expressions brief and manageable, we use the familiar μ μ del_(mu)\partial_{\mu}μ notation, where
μ = x μ μ = x μ del_(mu)=(del)/(delx^(mu))\partial_{\mu}=\frac{\partial}{\partial x^{\mu}}μ=xμ
We interpret μ μ del^(mu)\partial^{\mu}μ as
μ = η μ ν ν μ = η μ ν ν del^(mu)=eta^(mu nu)del_(nu)\partial^{\mu}=\eta^{\mu \nu} \partial_{\nu}μ=ημνν
5 5 ^(5){ }^{5}5 This is of course because ψ ψ psi^(†)rarr\psi^{\dagger} \rightarrowψ ψ e i α ψ e i α psi^(†)e^(-ialpha)\psi^{\dagger} \mathrm{e}^{-\mathrm{i} \alpha}ψeiα and so the angle α α alpha\alphaα cancels in the combinations of the field and its hermitian conjugate in L L L\mathcal{L}L.
where m m mmm is a constant. 4 4 ^(4){ }^{4}4 This theory has an internal symmetry known as a global phase symmetry, or global U ( 1 ) U ( 1 ) U(1)U(1)U(1) symmetry. This is shown if we make the replacement
(44.3) ψ ( x ) ψ ( x ) e i α (44.3) ψ ( x ) ψ ( x ) e i α {:(44.3)psi(x)rarr psi(x)e^(ialpha):}\begin{equation*} \psi(x) \rightarrow \psi(x) \mathrm{e}^{\mathrm{i} \alpha} \tag{44.3} \end{equation*}(44.3)ψ(x)ψ(x)eiα
since then the Lagrangian L L L\mathcal{L}L (and hence the equation of motion) does not change. 5 5 ^(5){ }^{5}5 The transformation in eqn 44.3 is a global transformation in that the phase changes by the same amount (i.e. an addition of α α alpha\alphaα ) at every point in spacetime.
We can now ask an interesting question: what if we attempt to change the phase by different amounts at each point in spacetime? This would imply we change the phases θ ( x ) θ ( x ) theta(x)\theta(x)θ(x) by an amount α ( x ) α ( x ) alpha(x)\alpha(x)α(x), that is, by an amount that depends in some arbitrary (but smoothly varying) manner on position in spacetime. This is known as a local transformation.
Example 44.2
Let's see how this unfolds. We have
μ ψ ( x ) μ ψ ( x ) e i α ( x ) = e i α ( x ) μ ψ ( x ) + ψ ( x ) e i α ( x ) i μ α ( x ) (44.4) = e i α ( x ) { μ + i [ μ α ( x ) ] } ψ ( x ) μ ψ ( x ) μ ψ ( x ) e i α ( x ) = e i α ( x ) μ ψ ( x ) + ψ ( x ) e i α ( x ) i μ α ( x ) (44.4) = e i α ( x ) μ + i μ α ( x ) ψ ( x ) {:[del_(mu)psi(x) rarrdel_(mu)psi(x)e^(ialpha(x))],[=e^(ialpha(x))del_(mu)psi(x)+psi(x)e^(ialpha(x))idel_(mu)alpha(x)],[(44.4)=e^(ialpha(x)){del_(mu)+i[del_(mu)alpha(x)]}psi(x)]:}\begin{align*} \partial_{\mu} \psi(x) & \rightarrow \partial_{\mu} \psi(x) \mathrm{e}^{\mathrm{i} \alpha(x)} \\ & =\mathrm{e}^{\mathrm{i} \alpha(x)} \partial_{\mu} \psi(x)+\psi(x) \mathrm{e}^{\mathrm{i} \alpha(x)} \mathrm{i} \partial_{\mu} \alpha(x) \\ & =\mathrm{e}^{\mathrm{i} \alpha(x)}\left\{\partial_{\mu}+\mathrm{i}\left[\partial_{\mu} \alpha(x)\right]\right\} \psi(x) \tag{44.4} \end{align*}μψ(x)μψ(x)eiα(x)=eiα(x)μψ(x)+ψ(x)eiα(x)iμα(x)(44.4)=eiα(x){μ+i[μα(x)]}ψ(x)
Similarly,
(44.5) μ ψ ( x ) e i α ( x ) { μ i [ μ α ( x ) ] } ψ ( x ) (44.5) μ ψ ( x ) e i α ( x ) μ i μ α ( x ) ψ ( x ) {:(44.5)del^(mu)psi^(†)(x)rarre^(-ialpha(x)){del^(mu)-i[del^(mu)alpha(x)]}psi^(†)(x):}\begin{equation*} \partial^{\mu} \psi^{\dagger}(x) \rightarrow \mathrm{e}^{-\mathrm{i} \alpha(x)}\left\{\partial^{\mu}-\mathrm{i}\left[\partial^{\mu} \alpha(x)\right]\right\} \psi^{\dagger}(x) \tag{44.5} \end{equation*}(44.5)μψ(x)eiα(x){μi[μα(x)]}ψ(x)
The first term in the Lagrangian becomes
(44.6) ( μ ψ ) ( μ ψ ) i ψ ( μ α ) ( μ ψ ) + i ψ ( μ ψ ) ( μ α ) + ψ ψ ( μ α ) ( μ α ) (44.6) μ ψ μ ψ i ψ μ α μ ψ + i ψ μ ψ μ α + ψ ψ μ α μ α {:(44.6)(del^(mu)psi^(†))(del_(mu)psi)-ipsi^(†)(del^(mu)alpha)(del_(mu)psi)+ipsi(del^(mu)psi^(†))(del_(mu)alpha)+psi^(†)psi(del^(mu)alpha)(del_(mu)alpha):}\begin{equation*} \left(\partial^{\mu} \psi^{\dagger}\right)\left(\partial_{\mu} \psi\right)-\mathrm{i} \psi^{\dagger}\left(\partial^{\mu} \alpha\right)\left(\partial_{\mu} \psi\right)+\mathrm{i} \psi\left(\partial^{\mu} \psi^{\dagger}\right)\left(\partial_{\mu} \alpha\right)+\psi^{\dagger} \psi\left(\partial^{\mu} \alpha\right)\left(\partial_{\mu} \alpha\right) \tag{44.6} \end{equation*}(44.6)(μψ)(μψ)iψ(μα)(μψ)+iψ(μψ)(μα)+ψψ(μα)(μα)
which is not what we started with.
The message is that, perhaps unsurprisingly, the theory is not invariant with respect to local phase transformations. However, we can fix things to guarantee local invariance, by adding another field into the mix. To that end we introduce a 1-form field A ~ ( x ) = A μ ( x ) d x μ A ~ ( x ) = A μ ( x ) d x μ tilde(A)(x)=A_(mu)(x)dx^(mu)\tilde{\boldsymbol{A}}(x)=A_{\mu}(x) \boldsymbol{d} x^{\mu}A~(x)=Aμ(x)dxμ. This field, whose job is to cancel out the effect of the change in internal variable α ( x ) α ( x ) alpha(x)\alpha(x)α(x) with position, is known as a gauge field. This enters into a new covariant field derivative D μ D μ D_(mu)D_{\mu}Dμ, defined by
(44.7) D μ ψ ( x ) = μ ψ ( x ) + i q A μ ( x ) ψ ( x ) (44.7) D μ ψ ( x ) = μ ψ ( x ) + i q A μ ( x ) ψ ( x ) {:(44.7)D_(mu)psi(x)=del_(mu)psi(x)+iqA_(mu)(x)psi(x):}\begin{equation*} D_{\mu} \psi(x)=\partial_{\mu} \psi(x)+\mathrm{i} q A_{\mu}(x) \psi(x) \tag{44.7} \end{equation*}(44.7)Dμψ(x)=μψ(x)+iqAμ(x)ψ(x)
We define the gauge field such that when we change the phase by angle α ( x ) α ( x ) alpha(x)\alpha(x)α(x), the components of the gauge field transform according to
(44.8) A μ ( x ) A μ ( x ) 1 q μ α ( x ) (44.8) A μ ( x ) A μ ( x ) 1 q μ α ( x ) {:(44.8)A_(mu)(x)rarrA_(mu)(x)-(1)/(q)del_(mu)alpha(x):}\begin{equation*} A_{\mu}(x) \rightarrow A_{\mu}(x)-\frac{1}{q} \partial_{\mu} \alpha(x) \tag{44.8} \end{equation*}(44.8)Aμ(x)Aμ(x)1qμα(x)
Let's see how things unfold now.
Example 44.3
If ψ ( x ) ψ ( x ) e i α ( x ) ψ ( x ) ψ ( x ) e i α ( x ) psi(x)rarr psi(x)e^(ialpha(x))\psi(x) \rightarrow \psi(x) \mathrm{e}^{\mathrm{i} \alpha(x)}ψ(x)ψ(x)eiα(x), then μ ψ e i α ( μ ψ ) + i ψ e i α ( μ α ) μ ψ e i α μ ψ + i ψ e i α μ α del_(mu)psi rarre^(ialpha)(del_(mu)psi)+ipsie^(ialpha)(del_(mu)alpha)\partial_{\mu} \psi \rightarrow \mathrm{e}^{\mathrm{i} \alpha}\left(\partial_{\mu} \psi\right)+\mathrm{i} \psi \mathrm{e}^{\mathrm{i} \alpha}\left(\partial_{\mu} \alpha\right)μψeiα(μψ)+iψeiα(μα) and so
D μ ψ = ( μ + i q A μ ) ψ e i α ( μ ψ ) + i ψ e i α ( μ α ) + i q A μ ψ e i α i ψ e i α ( μ α ) D μ ψ = μ + i q A μ ψ e i α μ ψ + i ψ e i α μ α + i q A μ ψ e i α i ψ e i α μ α D_(mu)psi=(del_(mu)+iqA_(mu))psi rarre^(ialpha)(del_(mu)psi)+ipsie^(ialpha)(del_(mu)alpha)+iqA_(mu)psie^(ialpha)-ipsie^(ialpha)(del_(mu)alpha)D_{\mu} \psi=\left(\partial_{\mu}+\mathrm{i} q A_{\mu}\right) \psi \rightarrow \mathrm{e}^{\mathrm{i} \alpha}\left(\partial_{\mu} \psi\right)+\mathrm{i} \psi \mathrm{e}^{\mathrm{i} \alpha}\left(\partial_{\mu} \alpha\right)+\mathrm{i} q A_{\mu} \psi \mathrm{e}^{\mathrm{i} \alpha}-\mathrm{i} \psi \mathrm{e}^{\mathrm{i} \alpha}\left(\partial_{\mu} \alpha\right)Dμψ=(μ+iqAμ)ψeiα(μψ)+iψeiα(μα)+iqAμψeiαiψeiα(μα)
(44.9) = e i α D μ ψ (44.9) = e i α D μ ψ {:(44.9)=e^(ialpha)D_(mu)psi:}\begin{equation*} =\mathrm{e}^{\mathrm{i} \alpha} D_{\mu} \psi \tag{44.9} \end{equation*}(44.9)=eiαDμψ
This property makes the whole Lagrangian invariant if we replace ordinary derivatives by covariant ones
(44.10) L = ( D μ ψ ) ( D μ ψ ) m 2 ψ ψ (44.10) L = D μ ψ D μ ψ m 2 ψ ψ {:(44.10)L=-(D^(mu)psi)^(†)(D_(mu)psi)-m^(2)psi^(†)psi:}\begin{equation*} \mathcal{L}=-\left(D^{\mu} \psi\right)^{\dagger}\left(D_{\mu} \psi\right)-m^{2} \psi^{\dagger} \psi \tag{44.10} \end{equation*}(44.10)L=(Dμψ)(Dμψ)m2ψψ
since now with D μ ψ e i α D μ ψ D μ ψ e i α D μ ψ D_(mu)psi rarre^(ialpha)D_(mu)psiD_{\mu} \psi \rightarrow \mathrm{e}^{\mathrm{i} \alpha} D_{\mu} \psiDμψeiαDμψ, the first term is invariant.
The message is that if the phase α ( x ) α ( x ) alpha(x)\alpha(x)α(x) is a function of x x xxx then, in order to guarantee local phase invariance, a new 1-form gauge field A ~ ( x ) A ~ ( x ) tilde(A)(x)\tilde{\boldsymbol{A}}(x)A~(x) with components A μ ( x ) A μ ( x ) A_(mu)(x)A_{\mu}(x)Aμ(x) is required to build the covariant field derivative. Furthermore, we recognize this field as akin to the electromagnetic gauge field introduced in Chapter 42, where we saw that one of its most intriguing features was that, without changing the Maxwell equations, it could be changed by an arbitrary amount
(44.11) A μ ( x ) A μ ( x ) μ χ ( x ) (44.11) A μ ( x ) A μ ( x ) μ χ ( x ) {:(44.11)A_(mu)(x)rarrA_(mu)(x)-del_(mu)chi(x):}\begin{equation*} A_{\mu}(x) \rightarrow A_{\mu}(x)-\partial_{\mu} \chi(x) \tag{44.11} \end{equation*}(44.11)Aμ(x)Aμ(x)μχ(x)
which known as a gauge transformation. We see from eqn 44.8 that if we identify χ ( x ) χ ( x ) chi(x)\chi(x)χ(x) with α ( x ) / q α ( x ) / q alpha(x)//q\alpha(x) / qα(x)/q, then the transformation demanded in eqn 44.8 is simply gauge invariance, confirming that A A A\boldsymbol{A}A has the usual properties of a gauge field as defined in electromagnetism.
To summarize, the complex scalar field theory can be made to have a local phase symmetry [that is, an invariance of the Lagrangian under a transformation θ ( x ) θ ( x ) + α ( x ) θ ( x ) θ ( x ) + α ( x ) theta(x)rarr theta(x)+alpha(x)\theta(x) \rightarrow \theta(x)+\alpha(x)θ(x)θ(x)+α(x) ] by introducing a gauge field A ~ ( x ) A ~ ( x ) tilde(A)(x)\tilde{\boldsymbol{A}}(x)A~(x) to build a covariant derivative with components D μ = μ + i q A μ D μ = μ + i q A μ D_(mu)=del_(mu)+iqA_(mu)D_{\mu}=\partial_{\mu}+\mathrm{i} q A_{\mu}Dμ=μ+iqAμ. The theory, made locally phase invariant by replacing μ D μ μ D μ del_(mu)rarrD_(mu)\partial_{\mu} \rightarrow D_{\mu}μDμ in the Lagrangian, is known as a gauge theory. The gauge field transforms via a gauge transformation A μ A μ μ α / q A μ A μ μ α / q A_(mu)rarrA_(mu)-del_(mu)alpha//qA_{\mu} \rightarrow A_{\mu}-\partial_{\mu} \alpha / qAμAμμα/q. Gauge theories are the basis of particle physics as well as the physics of superconductivity and topological states of matter like the fractional quantum Hall fluid. 6 6 ^(6){ }^{6}6
In terms of our picture of the fibre bundle, the local transformation θ ( x ) θ ( x ) + α ( x ) θ ( x ) θ ( x ) + α ( x ) theta(x)rarr theta(x)+alpha(x)\theta(x) \rightarrow \theta(x)+\alpha(x)θ(x)θ(x)+α(x) corresponds to a shift in the value of the phase stored in the fibre floating above point x x xxx. Since the shifts are different on different fibres, the resulting bundle is often called a strained bundle, in the same way that a strained solid has atoms displaced by different amounts as a function of position. The derivative μ μ del_(mu)\partial_{\mu}μ, connects the different fibres in that it forces us to consider differences between variables stored in different fibres. In order to take account of the strain in the bundle that we have produced, we say that we introduce a strain into the connection via the field A ~ ( x ) A ~ ( x ) tilde(A)(x)\tilde{\boldsymbol{A}}(x)A~(x). This all sounds vaguely reminiscent of the connection coefficients in geometry. In fact, a further insight into gauge fields is revealed if we compare the covariant derivative from relativity μ μ grad_(mu)\nabla_{\mu}μ with the covariant field derivative D μ D μ D_(mu)D_{\mu}Dμ, which is our next subject for discussion.
6 6 ^(6){ }^{6}6 In the case we have examined, if we interpret q q qqq as the electric charge, A ~ A ~ tilde(A)\tilde{\boldsymbol{A}}A~ is the electromagnetic field.
7 7 ^(7){ }^{7}7 We read off that, for the geometric version in eqn 44.15, we need some ex tra indices to keep track of which component of the vector we are examining. However, we can see that there is an analogy if we take i q A ν ( x ) i q A ν ( x ) iqA_(nu)(x)\mathrm{i} q A_{\nu}(x)iqAν(x) to be equivalent to Γ μ ν β ( x ) Γ μ ν β ( x ) Gamma^(mu)_(nu beta)(x)\Gamma^{\mu}{ }_{\nu \beta}(x)Γμνβ(x).

44.2 Parallel transport and field strength

Let's revisit the covariant derivative μ μ grad_(mu)\nabla_{\mu}μ and the parallel transport that it allows from the point of view of fields. Move from x x xxx to x + d x x + d x x+dxx+\mathrm{d} xx+dx and a field ψ ψ psi\psiψ changes by an amount d ψ d ψ dpsi\mathrm{d} \psidψ. The way to make sense of the change is to note that ψ ( x ) ψ ( x ) psi(x)\psi(x)ψ(x) and ψ ( x + d x ) ψ ( x + d x ) psi(x+dx)\psi(x+\mathrm{d} x)ψ(x+dx) are measured in different coordinate systems. Thus, the apparently innocent equation
(44.12) d ψ = ψ ( x + d x ) ψ ( x ) (44.12) d ψ = ψ ( x + d x ) ψ ( x ) {:(44.12)dpsi=psi(x+dx)-psi(x):}\begin{equation*} \mathrm{d} \psi=\psi(x+\mathrm{d} x)-\psi(x) \tag{44.12} \end{equation*}(44.12)dψ=ψ(x+dx)ψ(x)
that describes the total change in the field in moving through a distance d x d x dx\mathrm{d} xdx, carries information about two coordinate systems.
We want to isolate properties that change due to changes in the internal space of the field as the field is moved around. To do this we make use of parallel transport. In the geometric description of Chapter 7, we made sense of the covariant derivative conceptually as
( Covariant derivative of v ) u = ( total change in vector v ) ( change due to coordinate system ) (  Covariant   derivative of  v ) u = (  total change   in vector  v ) (  change due to   coordinate system  ) ((" Covariant ")/(" derivative of "v))_(u)=((" total change ")/(" in vector "v))-((" change due to ")/(" coordinate system "))\binom{\text { Covariant }}{\text { derivative of } \boldsymbol{v}}_{\boldsymbol{u}}=\binom{\text { total change }}{\text { in vector } \boldsymbol{v}}-\binom{\text { change due to }}{\text { coordinate system }}( Covariant  derivative of v)u=( total change  in vector v)( change due to  coordinate system ).
The condition for the parallel transport of a vector v v v\boldsymbol{v}v along a path whose tangent vector is u u u\boldsymbol{u}u, is that u v = 0 u v = 0 grad_(u)v=0\boldsymbol{\nabla}_{\boldsymbol{u}} \boldsymbol{v}=0uv=0, which is to say that the change in the vector components ( v μ / x ν ) v μ / x ν (delv^(mu)//delx^(nu))\left(\partial v^{\mu} / \partial x^{\nu}\right)(vμ/xν) reflect only the change in the coordinate system (which changes by an amount measured by Γ μ ν α v α Γ μ ν α v α -Gamma^(mu)_(nu alpha)v^(alpha)-\Gamma^{\mu}{ }_{\nu \alpha} v^{\alpha}Γμναvα ).
Example 44.4
We write u v = 0 u v = 0 grad_(u)v=0\boldsymbol{\nabla}_{\boldsymbol{u}} \boldsymbol{v}=0uv=0 in components as
(44.14) u ν v μ x ν + Γ ν β μ v β u ν = 0 (44.14) u ν v μ x ν + Γ ν β μ v β u ν = 0 {:(44.14)u^(nu)(delv^(mu))/(delx^(nu))+Gamma_(nu beta)^(mu)v^(beta)u^(nu)=0:}\begin{equation*} u^{\nu} \frac{\partial v^{\mu}}{\partial x^{\nu}}+\Gamma_{\nu \beta}^{\mu} v^{\beta} u^{\nu}=0 \tag{44.14} \end{equation*}(44.14)uνvμxν+Γνβμvβuν=0
Cancelling u ν u ν u^(nu)u^{\nu}uν, we see that the change in the vector components on parallel transport δ v μ δ v μ deltav^(mu)\delta v^{\mu}δvμ is given by
(44.15) δ v μ = Γ ν β μ v β d x ν (44.15) δ v μ = Γ ν β μ v β d x ν {:(44.15)deltav^(mu)=-Gamma_(nu beta)^(mu)v^(beta)dx^(nu):}\begin{equation*} \delta v^{\mu}=-\Gamma_{\nu \beta}^{\mu} v^{\beta} \mathrm{d} x^{\nu} \tag{44.15} \end{equation*}(44.15)δvμ=Γνβμvβdxν
The change in the components of the vector field are given by the connection coefficients, which encode how the coordinates change as we move around spacetime
In the same way that geometrical parallel transport allows us access to the change in the vector component δ v μ δ v μ deltav^(mu)\delta v^{\mu}δvμ due to the change in underlying coordinates, parallel transport also allows us access to the change in the scalar field δ ψ δ ψ delta psi\delta \psiδψ due only to the effect of the change in coordinate system on the field's internal coordinates. For the complex scalar field theory described above, we write the change in parallel transport as 7 7 ^(7){ }^{7}7
(44.16) δ ψ ( x ) = i q A ν ( x ) ψ ( x ) d x ν (44.16) δ ψ ( x ) = i q A ν ( x ) ψ ( x ) d x ν {:(44.16)delta psi(x)=-iqA_(nu)(x)psi(x)dx^(nu):}\begin{equation*} \delta \psi(x)=-\mathrm{i} q A_{\nu}(x) \psi(x) \mathrm{d} x^{\nu} \tag{44.16} \end{equation*}(44.16)δψ(x)=iqAν(x)ψ(x)dxν
We therefore arrive at a general description of the covariant derivative as evaluating d ψ d ψ dpsi\mathrm{d} \psidψ, the total change in the scalar field on translation minus
δ ψ δ ψ delta psi\delta \psiδψ, the change due to the effects of the change in coordinate system on the internal variables of the field. In words and symbols
D ψ = ( Change in field ) ( Change due to coordinate system ) = d ψ δ ψ (44.17) = d ψ + i q A μ ψ d x μ . D ψ = (  Change   in field  ) (  Change due to   coordinate system  ) = d ψ δ ψ (44.17) = d ψ + i q A μ ψ d x μ . {:[D psi=((" Change ")/(" in field "))-((" Change due to ")/(" coordinate system "))],[=dpsi-delta psi],[(44.17)=dpsi+iqA_(mu)psidx^(mu).]:}\begin{align*} D \psi & =\binom{\text { Change }}{\text { in field }}-\binom{\text { Change due to }}{\text { coordinate system }} \\ & =\mathrm{d} \psi-\delta \psi \\ & =\mathrm{d} \psi+\mathrm{i} q A_{\mu} \psi \mathrm{d} x^{\mu} . \tag{44.17} \end{align*}Dψ=( Change  in field )( Change due to  coordinate system )=dψδψ(44.17)=dψ+iqAμψdxμ.
We then have a general covariant derivative fit for a description of internal variables of
(44.18) D ψ d x μ D μ ψ = μ ψ + i q A μ ψ (44.18) D ψ d x μ D μ ψ = μ ψ + i q A μ ψ {:(44.18)(D psi)/((d)x^(mu))-=D_(mu)psi=del_(mu)psi+iqA_(mu)psi:}\begin{equation*} \frac{D \psi}{\mathrm{~d} x^{\mu}} \equiv D_{\mu} \psi=\partial_{\mu} \psi+\mathrm{i} q A_{\mu} \psi \tag{44.18} \end{equation*}(44.18)Dψ dxμDμψ=μψ+iqAμψ
Comparing this to the geometric version, we see how the components of the gauge field A ~ A ~ tilde(A)\tilde{\boldsymbol{A}}A~ play the role of the connections Γ μ α β Γ μ α β Gamma^(mu)_(alpha beta)\Gamma^{\mu}{ }_{\alpha \beta}Γμαβ.
Example 44.5
Returning once more to the fibre bundle picture, the idea of parallel transport can be generalized. As usual, we picture the fibres floating above the base space. Instead of parallel transport, we now ask that a quantity found in a fibre that doesn't change
as we move through spacetime, be connected by a horizontal curve as shown in as we move through spacetime, be connected by a horizontal curve as shown in
Fig. 44.4. We therefore have a notion of horizontal transport, expressed mathematically by saying that for horizontal transport D μ ψ = 0 D μ ψ = 0 D_(mu)psi=0D_{\mu} \psi=0Dμψ=0, or μ ψ = i q A μ μ ψ = i q A μ del_(mu)psi=-iqA_(mu)\partial_{\mu} \psi=-\mathrm{i} q A_{\mu}μψ=iqAμ.
Example 44.6
One use of parallel transport in Chapter 11 was to parallel transport a vector around a loop. This resulted in a measure of the curvature of spacetime. Now that we have a general prescription for the covariant derivative of a gauge field ψ ψ psi\psiψ, we will try it out by transporting the field ψ ψ psi\psiψ around a closed infinitesimal quadrilateral loop A B C D D A B C D D ABCDD\mathcal{A B C D} \mathcal{D}ABCDD and see what happens. Since the field is single valued, the effect of the change in the field d ψ d ψ dpsi\mathrm{d} \psidψ does not contribute, and we are be left with the change equivalent to the parallel transport of the field around the loop. Expanding to second order, the change in the field in moving from A A A\mathcal{A}A to B B B\mathcal{B}B, along a line element Δ x μ Δ x μ Deltax^(mu)\Delta x^{\mu}Δxμ is found by considering
ψ ( x B ) = ψ ( x A ) + D μ ψ ( x A ) Δ x μ + 1 2 D μ D ν ψ ( x A ) Δ x μ Δ x ν (44.19) = ( 1 + Δ x μ D μ + 1 2 Δ x μ Δ x ν D μ D ν ) ψ ( x A ) ψ x B = ψ x A + D μ ψ x A Δ x μ + 1 2 D μ D ν ψ x A Δ x μ Δ x ν (44.19) = 1 + Δ x μ D μ + 1 2 Δ x μ Δ x ν D μ D ν ψ x A {:[psi(x_(B))=psi(x_(A))+D_(mu)psi(x_(A))Deltax^(mu)+(1)/(2)D_(mu)D_(nu)psi(x_(A))Deltax^(mu)Deltax^(nu)],[(44.19)=(1+Deltax^(mu)D_(mu)+(1)/(2)*Deltax^(mu)Deltax^(nu)D_(mu)D_(nu))psi(x_(A))]:}\begin{align*} \psi\left(x_{\mathcal{B}}\right) & =\psi\left(x_{\mathcal{A}}\right)+D_{\mu} \psi\left(x_{\mathcal{A}}\right) \Delta x^{\mu}+\frac{1}{2} D_{\mu} D_{\nu} \psi\left(x_{\mathcal{A}}\right) \Delta x^{\mu} \Delta x^{\nu} \\ & =\left(1+\Delta x^{\mu} D_{\mu}+\frac{1}{2} \cdot \Delta x^{\mu} \Delta x^{\nu} D_{\mu} D_{\nu}\right) \psi\left(x_{\mathcal{A}}\right) \tag{44.19} \end{align*}ψ(xB)=ψ(xA)+Dμψ(xA)Δxμ+12DμDνψ(xA)ΔxμΔxν(44.19)=(1+ΔxμDμ+12ΔxμΔxνDμDν)ψ(xA)
We agree to keep all terms to second order in going around the rest of the loop. The process is then repeated for the line section leading from points B B B\mathcal{B}B to C C C\mathcal{C}C, which has length δ x μ δ x μ deltax^(mu)\delta x^{\mu}δxμ. Denoting ψ ( x B ) ψ x B psi(x_(B))\psi\left(x_{\mathcal{B}}\right)ψ(xB) as ψ B ψ B psi_(B)\psi_{\mathcal{B}}ψB we write
ψ C = ( 1 + δ x μ D μ + 1 2 δ x μ δ x ν D μ D ν ) ψ B = [ 1 + ( δ x μ + Δ x μ ) D μ + ( 1 2 δ x μ δ x ν + δ x μ Δ x ν + 1 2 Δ x μ Δ x ν ) D μ D ν ] ψ A . ] ψ C = 1 + δ x μ D μ + 1 2 δ x μ δ x ν D μ D ν ψ B = 1 + δ x μ + Δ x μ D μ + 1 2 δ x μ δ x ν + δ x μ Δ x ν + 1 2 Δ x μ Δ x ν D μ D ν ψ A . {:[psi_(C)=(1+deltax^(mu)D_(mu)+(1)/(2)deltax^(mu)deltax^(nu)D_(mu)D_(nu))psi_(B)],[{:=[1+(deltax^(mu)+Deltax^(mu))D_(mu)+((1)/(2)deltax^(mu)deltax^(nu)+deltax^(mu)Deltax^(nu)+(1)/(2)Deltax^(mu)Deltax^(nu))D_(mu)D_(nu)]psi_(A).]]:}\begin{aligned} \psi_{\mathcal{C}} & =\left(1+\delta x^{\mu} D_{\mu}+\frac{1}{2} \delta x^{\mu} \delta x^{\nu} D_{\mu} D_{\nu}\right) \psi_{\mathcal{B}} \\ & \left.=\left[1+\left(\delta x^{\mu}+\Delta x^{\mu}\right) D_{\mu}+\left(\frac{1}{2} \delta x^{\mu} \delta x^{\nu}+\delta x^{\mu} \Delta x^{\nu}+\frac{1}{2} \Delta x^{\mu} \Delta x^{\nu}\right) D_{\mu} D_{\nu}\right] \psi_{\mathcal{A}} .\right] \end{aligned}ψC=(1+δxμDμ+12δxμδxνDμDν)ψB=[1+(δxμ+Δxμ)Dμ+(12δxμδxν+δxμΔxν+12ΔxμΔxν)DμDν]ψA.]

M M longrightarrowM\longrightarrow \mathcal{M}M
Fig. 44.4 The bundle B B B\mathcal{B}B floating above the manifold M M M\mathcal{M}M. The analogue of parallel transport is horizontal transport through B B B\mathcal{B}B.
Moving now from points C C C\mathcal{C}C to D D D\mathcal{D}D, assumed to have separation Δ x μ Δ x μ Deltax^(mu)\Delta x^{\mu}Δxμ, we have
ψ D = ( 1 Δ x μ D μ + 1 2 Δ x μ Δ x ν D μ D ν ) ψ C = [ 1 + δ x μ D μ + 1 2 δ x μ δ x ν D μ D ν + ( δ x μ Δ x ν Δ x μ δ x ν ) D μ D ν ] ψ A ψ D = 1 Δ x μ D μ + 1 2 Δ x μ Δ x ν D μ D ν ψ C = 1 + δ x μ D μ + 1 2 δ x μ δ x ν D μ D ν + δ x μ Δ x ν Δ x μ δ x ν D μ D ν ψ A {:[psi_(D)=(1-Deltax^(mu)D_(mu)+(1)/(2)Deltax^(mu)Deltax^(nu)D_(mu)D_(nu))psi_(C)],[=[1+deltax^(mu)D_(mu)+(1)/(2)deltax^(mu)deltax^(nu)D_(mu)D_(nu)+(deltax^(mu)Deltax^(nu)-Deltax^(mu)deltax^(nu))D_(mu)D_(nu)]psi_(A)]:}\begin{aligned} \psi_{\mathcal{D}} & =\left(1-\Delta x^{\mu} D_{\mu}+\frac{1}{2} \Delta x^{\mu} \Delta x^{\nu} D_{\mu} D_{\nu}\right) \psi_{\mathcal{C}} \\ & =\left[1+\delta x^{\mu} D_{\mu}+\frac{1}{2} \delta x^{\mu} \delta x^{\nu} D_{\mu} D_{\nu}+\left(\delta x^{\mu} \Delta x^{\nu}-\Delta x^{\mu} \delta x^{\nu}\right) D_{\mu} D_{\nu}\right] \psi_{\mathcal{A}} \end{aligned}ψD=(1ΔxμDμ+12ΔxμΔxνDμDν)ψC=[1+δxμDμ+12δxμδxνDμDν+(δxμΔxνΔxμδxν)DμDν]ψA
and finally back to A A A\mathcal{A}A, with the result that
(44.20) ψ A = ( 1 + δ x μ Δ x ν [ D μ , D ν ] ) ψ A (44.20) ψ A = 1 + δ x μ Δ x ν D μ , D ν ψ A {:(44.20)psi_(A)^(')=(1+deltax^(mu)Deltax^(nu)[D_(mu),D_(nu)])psi_(A):}\begin{equation*} \psi_{\mathcal{A}}^{\prime}=\left(1+\delta x^{\mu} \Delta x^{\nu}\left[D_{\mu}, D_{\nu}\right]\right) \psi_{\mathcal{A}} \tag{44.20} \end{equation*}(44.20)ψA=(1+δxμΔxν[Dμ,Dν])ψA
where the commutator [ D μ , D ν ] = D μ D ν D μ D ν D μ , D ν = D μ D ν D μ D ν [D_(mu),D_(nu)]=D_(mu)D_(nu)-D_(mu)D_(nu)\left[D_{\mu}, D_{\nu}\right]=D_{\mu} D_{\nu}-D_{\mu} D_{\nu}[Dμ,Dν]=DμDνDμDν. Notice that the area of the loop is δ x μ Δ x ν δ x μ Δ x ν deltax^(mu)Deltax^(nu)\delta x^{\mu} \Delta x^{\nu}δxμΔxν and so the change in the field is given by the action of [ D μ , D ν ] D μ , D ν [D_(mu),D_(nu)]\left[D_{\mu}, D_{\nu}\right][Dμ,Dν] on the field, multiplied by the area of the loop.
It is striking and significant that the parallel transport around an infinitesimal loop has resulted in a change determined by the action of a linear operator [ D μ , D ν D μ , D ν [D_(mu),D_(nu):}\left[D_{\mu}, D_{\nu}\right.[Dμ,Dν ] on the field ψ ψ psi\psiψ. This commutator is often given the name the gauge field strength, or
(44.21) Q μ ν = [ D μ , D ν ] (44.21) Q μ ν = D μ , D ν {:(44.21)Q_(mu nu)=[D_(mu),D_(nu)]:}\begin{equation*} Q_{\mu \nu}=\left[D_{\mu}, D_{\nu}\right] \tag{44.21} \end{equation*}(44.21)Qμν=[Dμ,Dν]
Example 44.7
For our example of the complex scalar field, the gauge field strength is
(44.22) Q μ ν = [ D μ , D ν ] = [ ( μ + i q A μ ) , ( ν + i q A ν ) ] . (44.22) Q μ ν = D μ , D ν = μ + i q A μ , ν + i q A ν . {:(44.22)Q_(mu nu)=[D_(mu),D_(nu)]=[(del_(mu)+iqA_(mu)),(del_(nu)+iqA_(nu))].:}\begin{equation*} Q_{\mu \nu}=\left[D_{\mu}, D_{\nu}\right]=\left[\left(\partial_{\mu}+\mathrm{i} q A_{\mu}\right),\left(\partial_{\nu}+\mathrm{i} q A_{\nu}\right)\right] . \tag{44.22} \end{equation*}(44.22)Qμν=[Dμ,Dν]=[(μ+iqAμ),(ν+iqAν)].
We obtain
(44.23) [ D μ , D ν ] = i q ( μ A ν ν A μ ) = i q F μ ν (44.23) D μ , D ν = i q μ A ν ν A μ = i q F μ ν {:(44.23)[D_(mu),D_(nu)]=iq(del_(mu)A_(nu)-del_(nu)A_(mu))=iqF_(mu nu):}\begin{equation*} \left[D_{\mu}, D_{\nu}\right]=\mathrm{i} q\left(\partial_{\mu} A_{\nu}-\partial_{\nu} A_{\mu}\right)=\mathrm{i} q F_{\mu \nu} \tag{44.23} \end{equation*}(44.23)[Dμ,Dν]=iq(μAννAμ)=iqFμν
where F μ ν F μ ν F_(mu nu)F_{\mu \nu}Fμν are the components of the Faraday tensor F ~ F ~ tilde(F)\tilde{\boldsymbol{F}}F~.
We see from the last example that the commutator of the covariant derivative gives us access to the tensor F ~ F ~ tilde(F)\tilde{\boldsymbol{F}}F~ that tells us about the strength and dynamics of the gauge fields A ~ A ~ tilde(A)\tilde{\boldsymbol{A}}A~. In the fibre-bundle language, we can say that the [ D μ , D ν ] D μ , D ν [D_(mu),D_(nu)]\left[D_{\mu}, D_{\nu}\right][Dμ,Dν] is a curvature operator that tells us about the local strain in the bundle. This local strain is represented by the components of the curvature tensor F F F\boldsymbol{F}F.
What about the version for curved spacetime, rather than internal space? We saw in Chapter 35 that, when working in a coordinate system, parallel transport of a vector Z Z Z\boldsymbol{Z}Z around an infinitesimal loop also resulted in the action of the commutator of the covariant derivative μ μ grad_(mu)\boldsymbol{\nabla}_{\mu}μ. In direct analogy, we had the coordinate-frame expression
(44.24) [ μ , ν ] Z ρ = R σ μ ν ρ Z σ . (44.24) μ , ν Z ρ = R σ μ ν ρ Z σ . {:(44.24)[grad_(mu),grad_(nu)]Z^(rho)=R_(sigma mu nu)^(rho)Z^(sigma).:}\begin{equation*} \left[\nabla_{\mu}, \nabla_{\nu}\right] Z^{\rho}=R_{\sigma \mu \nu}^{\rho} Z^{\sigma} . \tag{44.24} \end{equation*}(44.24)[μ,ν]Zρ=RσμνρZσ.
It is clear that the Riemann tensor R R R\boldsymbol{R}R is analogous to the Faraday tensor F F F\boldsymbol{F}F in that it gives access to the local curvature. The difference is that the field strength R R R\boldsymbol{R}R describes the curvature of spacetime, while the field strength i q F i q F iqF\mathrm{i} q \boldsymbol{F}iqF describes the curvature of the fibre bundle containing the internal variables. We summarize the analogy in the table below, where we also link in the terminology of the fibre bundle picture.
General relativity Gauge theory Fibre bundles
coordinate transformation local phase transformation
strained bundle
connection coefficient Γ μ α β Γ μ α β Gamma^(mu)_(alpha beta)\Gamma^{\mu}{ }_{\alpha \beta}Γμαβ
strained bundle connection coefficient Gamma^(mu)_(alpha beta)| strained bundle | | :---: | | connection coefficient $\Gamma^{\mu}{ }_{\alpha \beta}$ |
gange field A μ A μ A_(mu)A_{\mu}Aμ
curvature R μ ν α β R μ ν α β R^(mu)_(nu alpha beta)R^{\mu}{ }_{\nu \alpha \beta}Rμναβ field strength iq q F μ ν q F μ ν qF_(mu nu)q F_{\mu \nu}qFμν bundle curvature
General relativity Gauge theory Fibre bundles coordinate transformation local phase transformation "strained bundle connection coefficient Gamma^(mu)_(alpha beta)" gange field A_(mu) curvature R^(mu)_(nu alpha beta) field strength iq qF_(mu nu) bundle curvature| General relativity | Gauge theory | Fibre bundles | | :---: | :---: | :---: | | coordinate transformation | local phase transformation | strained bundle <br> connection coefficient $\Gamma^{\mu}{ }_{\alpha \beta}$ | | gange field $A_{\mu}$ | | | | curvature $R^{\mu}{ }_{\nu \alpha \beta}$ | field strength iq $q F_{\mu \nu}$ | bundle curvature |
In this chapter, we have shown one example of a gauge theory: the complex scalar field theory [also known as U ( 1 ) U ( 1 ) U(1)U(1)U(1) theory]. In this case, the gauge field A ~ A ~ tilde(A)\tilde{\boldsymbol{A}}A~ is exactly the gauge field that describes electromagnetism. There are many more examples of gauge field theories, of which the complex scalar field theory is the simplest. They all share the structure presented here with the field strength being provided by the commutator of the covariant derivative. In later chapters, we shall encounter gauges again. Most dramatically, we will see in Chapter 48 an attempt to add a dimension into spacetime in order to accommodate an internal parameter.

Chapter summary

  • Internal variables carried by fields can be thought of as living in a fibre, suspended above a point in spacetime. The fibres are combined to make a fibre bundle.
  • A field can be made invariant with respect to local changes of phase by introducing a gauge field A ~ ( x ) A ~ ( x ) tilde(A)(x)\tilde{\boldsymbol{A}}(x)A~(x). This leads to a covariant derivative D μ = μ + i q A μ D μ = μ + i q A μ D_(mu)=del_(mu)+iqA_(mu)D_{\mu}=\partial_{\mu}+\mathrm{i} q A_{\mu}Dμ=μ+iqAμ.
  • Parallel transport of a field around a loop results in a contribution to the field proportional the action of the linear operator [ D μ , D ν ] = i q F μ ν D μ , D ν = i q F μ ν [D_(mu),D_(nu)]=iqF_(mu nu)\left[D_{\mu}, D_{\nu}\right]=\mathrm{i} q F_{\mu \nu}[Dμ,Dν]=iqFμν on the field. This is analogous to the case of gravitation, where the Riemann tensor R R R\boldsymbol{R}R takes the place of F ~ F ~ tilde(F)\tilde{\boldsymbol{F}}F~.

Exercises

(44.1) We saw in Chapter 43 how the Bianchi identity followed from the fact that the sum of curvatureinduced rotations associated with the six faces of an elementary cube is zero. Following the approach of Ryder (1985), we can repeat this for a gauge field with field strength [ D μ , D ν ] = i g G μ ν D μ , D ν = i g G μ ν [D_(mu),D_(nu)]=-igG_(mu nu)\left[D_{\mu}, D_{\nu}\right]=-\mathrm{i} g G_{\mu \nu}[Dμ,Dν]=igGμν.
Fig. 44.5 A path around the faces of a cube.
Referring to Fig: 44.5, the path around the cube is
( A B C D A P S R Q P A ) ( A B C D A P S R Q P A ) (ABCDAPSRQPA)(\mathcal{A B C D} \mathcal{A P S R \mathcal { Q P A } )}(ABCDAPSRQPA)
+ ( A D S P A B Q R C B A ) + ( A D S P A B Q R C B A ) +(ADSPABQRCBA)+(\mathcal{A D S P A B Q R C B A})+(ADSPABQRCBA)
+ ( A P Q B A D C R S A ) + ( A P Q B A D C R S A ) +(APQBADCRSA)+(\mathcal{A P Q B A D C R S \mathcal { A } )}+(APQBADCRSA)
We will evaluate the effect of traversing the first of these paths.
(a) Find the effect on a field ψ ψ psi\psiψ of traversing the path A B C D A A B C D A ABCDA\mathcal{A B C D} \mathcal{A}ABCDA.
(b) Next, compute the effect of traversing the path A P A P AP\mathcal{A P}AP.
(c) Now the circuit P S R Q P P S R Q P PSRQP\mathcal{P S R} \mathcal{Q P}PSRQP.
(d) Then the return path P A P A PA\mathcal{P} \mathcal{A}PA.
(e) Finally, combine all of the paths together to
obtain the result to leading order in the interaction strength g g ggg.
(f) Show that if the procedure is repeated around each of the three paths, the result is that the field changes by a factor
1 i g Δ V ρ μ ν ( D ρ G μ ν + D μ G ν ρ + D ν G ρ μ ) , ( 44.26 ) 1 i g Δ V ρ μ ν D ρ G μ ν + D μ G ν ρ + D ν G ρ μ , ( 44.26 ) 1-ig DeltaV^(rho mu nu)(D_(rho)G_(mu nu)+D_(mu)G_(nu rho)+D_(nu)G_(rho mu)),(44.26)1-\mathrm{i} g \Delta V^{\rho \mu \nu}\left(D_{\rho} G_{\mu \nu}+D_{\mu} G_{\nu \rho}+D_{\nu} G_{\rho \mu}\right),(44.26)1igΔVρμν(DρGμν+DμGνρ+DνGρμ),(44.26)
where V ρ μ ν V ρ μ ν V^(rho mu nu)V^{\rho \mu \nu}Vρμν is the volume of the cube.
(g) Show that the previous expression is consistent with the Jacobi identity
[ D ρ , [ D μ , D ν ] ] + [ D μ , [ D ν , D ρ ] ] + [ D ν , [ D ρ , D μ ] ] = 0 D ρ , D μ , D ν + D μ , D ν , D ρ + D ν , D ρ , D μ = 0 [D_(rho),[D_(mu),D_(nu)]]+[D_(mu),[D_(nu),D_(rho)]]+[D_(nu),[D_(rho),D_(mu)]]=0\left[D_{\rho},\left[D_{\mu}, D_{\nu}\right]\right]+\left[D_{\mu},\left[D_{\nu}, D_{\rho}\right]\right]+\left[D_{\nu},\left[D_{\rho}, D_{\mu}\right]\right]=0[Dρ,[Dμ,Dν]]+[Dμ,[Dν,Dρ]]+[Dν,[Dρ,Dμ]]=0
(44.2) Compute the energy-momentum tensor for complex scalar field theory in flat space.

Weak gravitational fields

We must touch his weaknesses with a delicate hand. There are some faults so nearly allied to excellence, that we scarce weed out the fault without eradicating the virtue.
Oliver Goldsmith (1728-1774) The Good-Natured Man
We have previously met applications of gravitation in circumstances in which gravity is at its strongest. This is exemplified by the physics of the black hole, where the extreme gravitational effects lead to a singularity. In this chapter, we discuss the opposite limit: weak gravitational fields. The subject is particularly important since this is the limit we experience most often in our interactions with Nature. Here we examine how best to treat gravitation in the limit that its effects are weak, and we shall build an approximate field theory that is notable in that it hosts wave-like excitations. As a warm up, we start by examining how the Einstein equations map onto Newton's theory of gravity in the limit of weak fields. 1 1 ^(1){ }^{1}1

45.1 The Newtonian limit

The Newtonian limit is the limit of velocities which are small compared to c c ccc (as measured by a particular 2 2 ^(2){ }^{2}2 observer). This corresponds to a velocity vector u u u\boldsymbol{u}u, with components ( u 0 , u 1 , u 2 , u 3 ) u 0 , u 1 , u 2 , u 3 (u^(0),u^(1),u^(2),u^(3))\left(u^{0}, u^{1}, u^{2}, u^{3}\right)(u0,u1,u2,u3), for which the timelike component u 0 u 0 u^(0)u^{0}u0 much larger than the spacelike components u i u i u^(i)u^{i}ui. In other words, slow-moving objects have velocities with components written approximately as ( 1 , u ) ( 1 , u ) (1, vec(u))(1, \vec{u})(1,u) with 3 u i 1 3 u i 1 ^(3)u^(i)≪1^{3} u^{i} \ll 13ui1. In order that gravitational effects do not cause masses to attain high velocities, this limit also implies that the gravitational field is limited in strength. Since it is mass-energy that causes spacetime to curve, a limitation of the strength of gravitational curvature constrains the energy-momentum tensor. Specifically, we demand that in the Newtonian limit, all stresses T i j T i j T^(ij)T^{i j}Tij must be small compared to the density of mass energy T 00 T 00 T^(00)T^{00}T00, so that 4 4 ^(4){ }^{4}4
(45.2) | T i j | T 00 1 (45.2) T i j T 00 1 {:(45.2)(|T^(ij)|)/(T^(00))≪1:}\begin{equation*} \frac{\left|T^{i j}\right|}{T^{00}} \ll 1 \tag{45.2} \end{equation*}(45.2)|Tij|T001
This provides a working definition of what we mean by weak gravitation. In the weak-field limit, spacetime is almost flat. Perfectly flat spacetime is described by the Minkowski metric η η eta\boldsymbol{\eta}η and so the weak field causes a small perturbation to this underlying geometry. The deviation from the near-flatness is encoded in a small additive correction to

45.1 The Newtonian limit 485 45.2 Linearized theory of gravitation 487 45.3 Exploiting gauges 488 Chapter summary 491 Exercises 492
1 1 ^(1){ }^{1}1 Some of the arguments presented in this chapter were trailed back in Section 14.1.
2 2 ^(2){ }^{2}2 Not all observers will agree that a particular velocity is in the Newtonian limit.
3 3 ^(3){ }^{3}3 That is to say, of course, that u i u i u^(i)≪u^{i} \llui c. Our choice of units obscures the identification of when relativistic effects should be expected to be large. The relevant factor for velocities is v 2 / c 2 v 2 / c 2 v^(2)//c^(2)v^{2} / c^{2}v2/c2, which approaches unity in the relativiswhich approaches unity in the relativis-
tic limit. Using some dimensional analysis we can replace v 2 v 2 v^(2)v^{2}v2 with G M / r G M / r GM//rG M / rGM/r, which has the same dimensions, and so we conclude that when the factor
(45.1) ε = G M c 2 r (45.1) ε = G M c 2 r {:(45.1)epsi=(GM)/(c^(2)r):}\begin{equation*} \varepsilon=\frac{G M}{c^{2} r} \tag{45.1} \end{equation*}(45.1)ε=GMc2r
is of order 1, relativistic effects are important. When ε 1 ε 1 epsi≪1\varepsilon \ll 1ε1, we are in the Newtonian limit.
4 4 ^(4){ }^{4}4 See Section 14.1 for further justification of this point.
5 5 ^(5){ }^{5}5 This allows gravitation to be treated in the same way as the other classi cal field theories, such as electromagnetism, using the tools from Chapter 40 .
the Minkowski metric, such that the components of the full metric g g g\boldsymbol{g}g are written as
(45.3) g α β = η α β + h α β (45.3) g α β = η α β + h α β {:(45.3)g_(alpha beta)=eta_(alpha beta)+h_(alpha beta):}\begin{equation*} g_{\alpha \beta}=\eta_{\alpha \beta}+h_{\alpha \beta} \tag{45.3} \end{equation*}(45.3)gαβ=ηαβ+hαβ
with | h α β | 1 h α β 1 |h_(alpha beta)|≪1\left|h_{\alpha \beta}\right| \ll 1|hαβ|1. The small additional components h α β h α β h_(alpha beta)h_{\alpha \beta}hαβ will act as a symmetric tensor field. To see this, consider a Lorentz transformation of coordinates x μ = Λ μ ν x ν x μ = Λ μ ν x ν x^(mu)=Lambda^(mu)_(nu^('))x^(nu^('))x^{\mu}=\Lambda^{\mu}{ }_{\nu^{\prime}} x^{\nu^{\prime}}xμ=Λμνxν. Equation 45.3 is altered when we try to transform the components of g g g\boldsymbol{g}g from a frame with primed indices into a frame with unprimed indices using the Lorentz transformations. We have
g α β = η α β + h α β = Λ α μ Λ β ν ( η μ ν + h μ ν ) (45.4) = η α β + Λ α μ Λ β ν h μ ν g α β = η α β + h α β = Λ α μ Λ β ν η μ ν + h μ ν (45.4) = η α β + Λ α μ Λ β ν h μ ν {:[g_(alpha^(')beta^('))=eta_(alpha^(')beta^('))+h_(alpha^(')beta^('))=Lambda_(alpha^('))^(mu)Lambda_(beta^('))^(nu)(eta_(mu nu)+h_(mu nu))],[(45.4)=eta_(alpha^(')beta^('))+Lambda_(alpha^('))^(mu)Lambda_(beta^('))^(nu)h_(mu nu)]:}\begin{align*} g_{\alpha^{\prime} \beta^{\prime}}=\eta_{\alpha^{\prime} \beta^{\prime}}+h_{\alpha^{\prime} \beta^{\prime}} & =\Lambda_{\alpha^{\prime}}^{\mu} \Lambda_{\beta^{\prime}}^{\nu}\left(\eta_{\mu \nu}+h_{\mu \nu}\right) \\ & =\eta_{\alpha^{\prime} \beta^{\prime}}+\Lambda_{\alpha^{\prime}}^{\mu} \Lambda_{\beta^{\prime}}^{\nu} h_{\mu \nu} \tag{45.4} \end{align*}gαβ=ηαβ+hαβ=ΛαμΛβν(ημν+hμν)(45.4)=ηαβ+ΛαμΛβνhμν
We conclude that the corrections h μ ν h μ ν h_(mu nu)h_{\mu \nu}hμν transform like the components of a tensor in flat spacetime with
(45.5) h α β = Λ α μ Λ β ν h μ ν (45.5) h α β = Λ α μ Λ β ν h μ ν {:(45.5)h_(alpha^(')beta^('))=Lambda_(alpha^('))^(mu)Lambda_(beta^('))^(nu)h_(mu nu):}\begin{equation*} h_{\alpha^{\prime} \beta^{\prime}}=\Lambda_{\alpha^{\prime}}^{\mu} \Lambda_{\beta^{\prime}}^{\nu} h_{\mu \nu} \tag{45.5} \end{equation*}(45.5)hαβ=ΛαμΛβνhμν
It is convenient, therefore, to conceptualize weak gravitational fields as existing in a flat spacetime with Minkowski metric η η eta\boldsymbol{\eta}η, but filled with a symmetric tensor field 5 h ( x ) 5 h ( x ) ^(5)h(x){ }^{5} \boldsymbol{h}(x)5h(x).
It should be stressed though that this picture is a fiction! It is not possible within general relativity to exactly describe a curved spacetime as flat Minkowski space with an additional field defined on it, hence the reason for Einstein's geometrical theory in the first place. So we should keep in mind that the spacetime is actually curved and it is only an approximation to treat it as flat. Indeed, h h h\boldsymbol{h}h only looks like a tensor when we assume flat spacetime, and if we were to be forced to consider coordinate transformations other than the Lorentz transformations, then h h h\boldsymbol{h}h would not necessarily transform as a tensor. Nonetheless, this picture of a flat spacetime with a field h ( x ) h ( x ) h(x)\boldsymbol{h}(x)h(x) overlaid is a very useful one that allows us to compute the effects of gravitation using h ( x ) h ( x ) h(x)\boldsymbol{h}(x)h(x), while raising and lowering indices using the components of the Minkowski metric. In addition, in order to match the boundary condition that the gravitation field dies away at infinity [i.e. the potential Φ ( r ) = 0 Φ ( r ) = 0 Phi(r rarr oo)=0\Phi(r \rightarrow \infty)=0Φ(r)=0 ], we will also demand that h ( r ) = 0 h ( r ) = 0 h(r rarr oo)=0h(r \rightarrow \infty)=0h(r)=0.
We have said several times how unique gravitation is in its treatment of the metric field. The weak-field approximation has the great advantage that it gives us an approximate theory that closely resembles all other classical theories of fields (in flat spacetime) in which classical fields fill all of spacetime. So now, instead of gravitation determining the metric field, gravitation is described by the field h h h\boldsymbol{h}h in flat spacetime, and it is the dynamics of this h h h\boldsymbol{h}h-field that we seek to describe. This is a really useful simplification. The next step in our argument is to find a linear version of the Einstein equation that involves the h h h\boldsymbol{h}h-field, rather than the Einstein tensor G G G\boldsymbol{G}G, and this linearized approximation to general relativity, which is valid in the weak-field limit, will also set us up for exploring the properties of gravitational waves.

45.2 Linearized theory of gravitation

General relativity is a nonlinear field theory. This may be traced back to the products of the connection coefficients that feature in the expression for the Riemann tensor. 6 6 ^(6){ }^{6}6 In the limit of small interaction strength, many nonlinear field theories can be linearized by dropping nonlinear terms rendered small by the small size of the interaction strength. This is what we shall do here.
In the last section, we saw that the components of the metric may be written as g α β = η α β + h α β g α β = η α β + h α β g_(alpha beta)=eta_(alpha beta)+h_(alpha beta)g_{\alpha \beta}=\eta_{\alpha \beta}+h_{\alpha \beta}gαβ=ηαβ+hαβ, where we agree to raise and lower indices with η μ ν η μ ν eta^(mu nu)\eta^{\mu \nu}ημν and η μ ν η μ ν eta_(mu nu)\eta_{\mu \nu}ημν, respectively. With this in mind, the connection coefficients are given by
Γ α β μ = 1 2 η μ ν ( h α ν , β + h β ν , α h α β , ν ) (45.6) = 1 2 ( h α , β μ + h β μ , α h α β , μ ) Γ α β μ = 1 2 η μ ν h α ν , β + h β ν , α h α β , ν (45.6) = 1 2 h α , β μ + h β μ , α h α β , μ {:[Gamma_(alpha beta)^(mu)=(1)/(2)*eta^(mu nu)(h_(alpha nu,beta)+h_(beta nu,alpha)-h_(alpha beta,nu))],[(45.6)=(1)/(2)*(h_(alpha^('),beta)^(mu)+(h_(beta)^(mu),alpha)-h_(alpha beta)^(,mu))]:}\begin{align*} \Gamma_{\alpha \beta}^{\mu} & =\frac{1}{2} \cdot \eta^{\mu \nu}\left(h_{\alpha \nu, \beta}+h_{\beta \nu, \alpha}-h_{\alpha \beta, \nu}\right) \\ & =\frac{1}{2} \cdot\left(h_{\alpha^{\prime}, \beta}^{\mu}+{h_{\beta}{ }^{\mu}, \alpha}-h_{\alpha \beta}{ }^{, \mu}\right) \tag{45.6} \end{align*}Γαβμ=12ημν(hαν,β+hβν,αhαβ,ν)(45.6)=12(hα,βμ+hβμ,αhαβ,μ)
where we have extended the comma notation to take in up indices as well as down ones. We'll use this expression for the connections to compute the components of the Riemann tensor. However, since we are seeking a linearized theory, we ignore terms that are products of the Γ Γ Gamma\GammaΓ s, leaving us with a simplified expression for the components of R R R\boldsymbol{R}R that reads
(45.7) R α μ β ν η α λ ( Γ μ β , ν λ Γ β μ , ν λ ) (45.7) R α μ β ν η α λ Γ μ β , ν λ Γ β μ , ν λ {:(45.7)R_(alpha mu beta nu)~~eta_(alpha lambda)(Gamma_(mu beta,nu)^(lambda)-Gamma_(beta mu,nu)^(lambda)):}\begin{equation*} R_{\alpha \mu \beta \nu} \approx \eta_{\alpha \lambda}\left(\Gamma_{\mu \beta, \nu}^{\lambda}-\Gamma_{\beta \mu, \nu}^{\lambda}\right) \tag{45.7} \end{equation*}(45.7)Rαμβνηαλ(Γμβ,νλΓβμ,νλ)

Example 45.1

In terms of the metric components, we obtain (after some reordering)
(45.8) R α μ β ν = 1 2 ( h μ β , ν α + h ν α , μ β h μ ν , α β h α β , μ ν ) . (45.8) R α μ β ν = 1 2 h μ β , ν α + h ν α , μ β h μ ν , α β h α β , μ ν . {:(45.8)R_(alpha mu beta nu)=(1)/(2)(h_(mu beta,nu alpha)+h_(nu alpha,mu beta)-h_(mu nu,alpha beta)-h_(alpha beta,mu nu)).:}\begin{equation*} R_{\alpha \mu \beta \nu}=\frac{1}{2}\left(h_{\mu \beta, \nu \alpha}+h_{\nu \alpha, \mu \beta}-h_{\mu \nu, \alpha \beta}-h_{\alpha \beta, \mu \nu}\right) . \tag{45.8} \end{equation*}(45.8)Rαμβν=12(hμβ,να+hνα,μβhμν,αβhαβ,μν).
At the same level of approximation, the Ricci tensor is given by
R μ ν = Γ α ν μ , α Γ α α μ , ν (45.9) = 1 2 ( h μ α , ν α + h ν α , μ α h μ ν , α α h , μ ν ) , . . R μ ν = Γ α ν μ , α Γ α α μ , ν (45.9) = 1 2 h μ α , ν α + h ν α , μ α h μ ν , α α h , μ ν , {:[R_(mu nu)=Gamma^(alpha)_(nu mu,alpha)-Gamma^(alpha)_(alpha mu,nu)],[(45.9)=(1)/(2)(h_(mu)^(alpha)_(,nu alpha)+{:h_(nu)^(alpha)_(,mu alpha)-h_(mu nu,alpha)^(alpha)-h_(,mu nu)),^(". "):}". "]:}\begin{align*} R_{\mu \nu} & =\Gamma^{\alpha}{ }_{\nu \mu, \alpha}-\Gamma^{\alpha}{ }_{\alpha \mu, \nu} \\ & =\frac{1}{2}\left(h_{\mu}{ }^{\alpha}{ }_{, \nu \alpha}+{\left.h_{\nu}{ }^{\alpha}{ }_{, \mu \alpha}-h_{\mu \nu, \alpha}{ }^{\alpha}-h_{, \mu \nu}\right),}^{\text {. }}\right. \text {. } \tag{45.9} \end{align*}Rμν=Γανμ,αΓααμ,ν(45.9)=12(hμα,να+hνα,μαhμν,ααh,μν),
where, in the final term we have defined the trace h = h α α = η α β h α β h = h α α = η α β h α β h=h^(alpha)_(alpha)=eta^(alpha beta)h_(alpha beta)h=h^{\alpha}{ }_{\alpha}=\eta^{\alpha \beta} h_{\alpha \beta}h=hαα=ηαβhαβ.
The tools from the last example allow us to work out the Einstein equation G μ ν = R μ ν 1 2 g μ ν R = 8 π T μ ν G μ ν = R μ ν 1 2 g μ ν R = 8 π T μ ν G_(mu nu)=R_(mu nu)-(1)/(2)g_(mu nu)R=8piT_(mu nu)G_{\mu \nu}=R_{\mu \nu}-\frac{1}{2} g_{\mu \nu} R=8 \pi T_{\mu \nu}Gμν=Rμν12gμνR=8πTμν. We obtain the rather unmemorable expression
(45.10) h μ α , ν α + h ν α , μ α h μ ν , α α h , μ ν η μ ν ( h α β , α β h , β β ) = 16 π T μ ν (45.10) h μ α , ν α + h ν α , μ α h μ ν , α α h , μ ν η μ ν h α β , α β h , β β = 16 π T μ ν {:(45.10)h_(mu alpha,nu)^(alpha)+h_(nu alpha,mu)^(alpha)-h_(mu nu,alpha)^(alpha)-h_(,mu nu)-eta_(mu nu)(h_(alpha beta)^(,alpha beta)-h_(,beta)^(beta))=16 piT_(mu nu):}\begin{equation*} h_{\mu \alpha, \nu}^{\alpha}+h_{\nu \alpha, \mu}^{\alpha}-h_{\mu \nu, \alpha}^{\alpha}-h_{, \mu \nu}-\eta_{\mu \nu}\left(h_{\alpha \beta}^{, \alpha \beta}-h_{, \beta}^{\beta}\right)=16 \pi T_{\mu \nu} \tag{45.10} \end{equation*}(45.10)hμα,να+hνα,μαhμν,ααh,μνημν(hαβ,αβh,ββ)=16πTμν
To simplify this, we define the bar notation for trace-reversed 7 7 ^(7){ }^{7}7 quantities
(45.13) h ¯ μ ν = h μ ν 1 2 η μ ν h (45.13) h ¯ μ ν = h μ ν 1 2 η μ ν h {:(45.13) bar(h)_(mu nu)=h_(mu nu)-(1)/(2)eta_(mu nu)h:}\begin{equation*} \bar{h}_{\mu \nu}=h_{\mu \nu}-\frac{1}{2} \eta_{\mu \nu} h \tag{45.13} \end{equation*}(45.13)h¯μν=hμν12ημνh
6 6 ^(6){ }^{6}6 Recall that
R α β μ ν = Γ α ν β , μ Γ α μ β , ν R α β μ ν = Γ α ν β , μ Γ α μ β , ν R^(alpha)_(beta mu nu)=Gamma^(alpha)_(nu beta,mu)-Gamma^(alpha)_(mu beta,nu)R^{\alpha}{ }_{\beta \mu \nu}=\Gamma^{\alpha}{ }_{\nu \beta, \mu}-\Gamma^{\alpha}{ }_{\mu \beta, \nu}Rαβμν=Γανβ,μΓαμβ,ν
+ Γ μ σ α Γ ν β σ Γ ν σ α Γ σ μ β . + Γ μ σ α Γ ν β σ Γ ν σ α Γ σ μ β . +Gamma_(mu sigma)^(alpha)Gamma_(nu beta)^(sigma)-Gamma_(nu sigma)^(alpha)Gamma^(sigma)_(mu beta).+\Gamma_{\mu \sigma}^{\alpha} \Gamma_{\nu \beta}^{\sigma}-\Gamma_{\nu \sigma}^{\alpha} \Gamma^{\sigma}{ }_{\mu \beta} .+ΓμσαΓνβσΓνσαΓσμβ.
Terms three and four on the right-hand side lead to nonlinearity.
7 7 ^(7){ }^{7}7 Recall that the name 'trace-reversed' comes from the fact that
(45.11) h ¯ = h ¯ α α = h (45.11) h ¯ = h ¯ α α = h {:(45.11) bar(h)= bar(h)_(alpha)^(alpha)=-h:}\begin{equation*} \bar{h}=\bar{h}_{\alpha}^{\alpha}=-h \tag{45.11} \end{equation*}(45.11)h¯=h¯αα=h
This means that G μ ν = R ¯ μ ν G μ ν = R ¯ μ ν G_(mu nu)= bar(R)_(mu nu)G_{\mu \nu}=\bar{R}_{\mu \nu}Gμν=R¯μν. Note that h ¯ μ ν = h μ ν h ¯ ¯ μ ν = h μ ν bar(bar(h))_(mu nu)=h_(mu nu)\overline{\bar{h}}_{\mu \nu}=h_{\mu \nu}h¯μν=hμν, so that
(45.12) h μ ν = h ¯ μ ν 1 2 η μ ν h ¯ (45.12) h μ ν = h ¯ μ ν 1 2 η μ ν h ¯ {:(45.12)h_(mu nu)= bar(h)_(mu nu)-(1)/(2)eta_(mu nu) bar(h):}\begin{equation*} h_{\mu \nu}=\bar{h}_{\mu \nu}-\frac{1}{2} \eta_{\mu \nu} \bar{h} \tag{45.12} \end{equation*}(45.12)hμν=h¯μν12ημνh¯
8 8 ^(8){ }^{8}8 Recall the situation in electromag netism where we are free to pick the components of the electromagnetic field 1-form A ~ ( x ) A ~ ( x ) tilde(A)(x)\tilde{\boldsymbol{A}}(x)A~(x). Specifically, we can always make the transformation A μ A μ A_(mu)rarrA_{\mu} \rightarrowAμ A μ χ , μ A μ χ , μ A_(mu)-chi,muA_{\mu}-\chi, \muAμχ,μ without making any changes to Maxwell's equations. We can therefore pick the function χ χ chi\chiχ in such a way as to simplify the problem at hand.
9 9 ^(9){ }^{9}9 Unlike the transformation in Chapter 44: θ ( x ) θ ( x ) + α ( x ) θ ( x ) θ ( x ) + α ( x ) theta(x)rarr theta(x)+alpha(x)\theta(x) \rightarrow \theta(x)+\alpha(x)θ(x)θ(x)+α(x), where we were changing an internal variable at each point in spacetime, we must label the underlying points in the manifold with labels like P P P\mathcal{P}P, since the spacetime coordinates themselves are being trans formed. The choice of length scale in the coordinates implied by the choice of ξ μ ( P ) ξ μ ( P ) xi^(mu)(P)\xi^{\mu}(\mathcal{P})ξμ(P) in eqn 45.17 was called a 'gauge by Hermann Weyl, where the use of the term is motivated by the 'gauge' of wires and rails.
Using the trace-reversed fields we find the Einstein tensor becomes
(45.14) G μ ν = 1 2 [ h ¯ μ ν , α α + η μ ν h ¯ α β , α β h ¯ μ α , ν α h ¯ ν α , μ α ] (45.14) G μ ν = 1 2 h ¯ μ ν , α α + η μ ν h ¯ α β , α β h ¯ μ α , ν α h ¯ ν α , μ α {:(45.14)G_(mu nu)=-(1)/(2)[ bar(h)_(mu nu,alpha)^(alpha)+eta_(mu nu) bar(h)_(alpha beta,)^(alpha beta)- bar(h)_(mu alpha,_(nu))^(alpha)- bar(h)_(nu alpha,_(mu))^(alpha)]:}\begin{equation*} G_{\mu \nu}=-\frac{1}{2}\left[\bar{h}_{\mu \nu, \alpha}^{\alpha}+\eta_{\mu \nu} \bar{h}_{\alpha \beta,}^{\alpha \beta}-\bar{h}_{\mu \alpha,{ }_{\nu}}^{\alpha}-\bar{h}_{\nu \alpha,{ }_{\mu}}^{\alpha}\right] \tag{45.14} \end{equation*}(45.14)Gμν=12[h¯μν,αα+ημνh¯αβ,αβh¯μα,ναh¯να,μα]
so that we have another formidable-looking expression
(45.15) h ¯ μ ν , α α η μ ν h ¯ α β , α β + h ¯ μ α , ν α + h ¯ ν α , μ α = 16 π T μ ν (45.15) h ¯ μ ν , α α η μ ν h ¯ α β , α β + h ¯ μ α , ν α + h ¯ ν α , μ α = 16 π T μ ν {:(45.15)- bar(h)_(mu nu,alpha)^(alpha)-eta_(mu nu) bar(h)_(alpha beta,)^(alpha beta)+ bar(h)_(mu alpha,_(nu))^(alpha)+ bar(h)_(nu alpha,_(mu))^(alpha)=16 piT_(mu nu):}\begin{equation*} -\bar{h}_{\mu \nu, \alpha}^{\alpha}-\eta_{\mu \nu} \bar{h}_{\alpha \beta,}{ }^{\alpha \beta}+\bar{h}_{\mu \alpha,{ }_{\nu}}^{\alpha}+\bar{h}_{\nu \alpha,{ }_{\mu}}^{\alpha}=16 \pi T_{\mu \nu} \tag{45.15} \end{equation*}(45.15)h¯μν,ααημνh¯αβ,αβ+h¯μα,να+h¯να,μα=16πTμν
We have succeeded in writing down a linearized version of the Einstein equation in terms of the h h h\boldsymbol{h}h-field. It has the disadvantage of looking rather complicated. However, a drastic simplification is possible, by exploiting the notion of gauge freedom discussed in Chapters 42 and 44.

45.3 Exploiting gauges

In simplifying the Einstein equation, it would be very helpful to us if we were allowed to write
(45.16) h ¯ , α μ α = 0 (45.16) h ¯ , α μ α = 0 {:(45.16) bar(h)_(,alpha)^(mu alpha)=0:}\begin{equation*} \bar{h}_{, \alpha}^{\mu \alpha}=0 \tag{45.16} \end{equation*}(45.16)h¯,αμα=0
since this would kill off several terms in eqn 45.15. It turns out that this is a condition we can indeed fix. The reason for this stems from the gauge invariance properties of the Einstein equation. 8 8 ^(8){ }^{8}8
In order to see the gauge transformation properties, we allow the coordinates to transform via an infinitesimal coordinate transformation 9 9 ^(9){ }^{9}9
(45.17) x μ ( P ) = x μ ( P ) + ξ μ ( P ) (45.17) x μ ( P ) = x μ ( P ) + ξ μ ( P ) {:(45.17)x^(mu^('))(P)=x^(mu)(P)+xi^(mu)(P):}\begin{equation*} x^{\mu^{\prime}}(\mathcal{P})=x^{\mu}(\mathcal{P})+\xi^{\mu}(\mathcal{P}) \tag{45.17} \end{equation*}(45.17)xμ(P)=xμ(P)+ξμ(P)
where P P P\mathcal{P}P is some point and the shifts ξ μ ( P ) ξ μ ( P ) xi^(mu)(P)\xi^{\mu}(\mathcal{P})ξμ(P) are small enough that the components of h h h\boldsymbol{h}h in the primed frame obey | h μ ν | 1 h μ ν 1 |h_(mu^(')nu^('))|≪1\left|h_{\mu^{\prime} \nu^{\prime}}\right| \ll 1|hμν|1. This tiny transformation is designed to make only tiny changes to the smoothly varying fields that exist in the almost-flat spacetime. However, there is one field where they cannot be ignored: the metric. This is because it is exactly the very small variations in the metric that describe gravitation in this limit. Recall that metric components have the transformation law
(45.18) g ρ σ [ x α ( P ) ] = g μ ν [ x α ( P ) ] x μ x ρ x ν x σ (45.18) g ρ σ x α ( P ) = g μ ν x α ( P ) x μ x ρ x ν x σ {:(45.18)g_(rho^(')sigma^('))[x^(alpha^('))(P)]=g_(mu nu)[x^(alpha)(P)](delx^(mu))/(delx^(rho^(')))*(delx^(nu))/(delx^(sigma^('))):}\begin{equation*} g_{\rho^{\prime} \sigma^{\prime}}\left[x^{\alpha^{\prime}}(\mathcal{P})\right]=g_{\mu \nu}\left[x^{\alpha}(\mathcal{P})\right] \frac{\partial x^{\mu}}{\partial x^{\rho^{\prime}}} \cdot \frac{\partial x^{\nu}}{\partial x^{\sigma^{\prime}}} \tag{45.18} \end{equation*}(45.18)gρσ[xα(P)]=gμν[xα(P)]xμxρxνxσ
Substitute from eqns 45.3 and 45.17 , and we see that
g ρ σ ( x α = a α ) = η ρ σ + h ρ σ ( x α = a α ) ξ ρ , σ ξ σ , ρ + g ρ σ x α = a α = η ρ σ + h ρ σ x α = a α ξ ρ , σ ξ σ , ρ + g_(rho^(')sigma^('))(x^(alpha^('))=a^(alpha))=eta_(rho^(')sigma^('))+h_(rho^(')sigma^('))(x^(alpha^('))=a^(alpha))-xi_(rho^('),sigma^('))-xi_(sigma^('),rho^('))+g_{\rho^{\prime} \sigma^{\prime}}\left(x^{\alpha^{\prime}}=a^{\alpha}\right)=\eta_{\rho^{\prime} \sigma^{\prime}}+h_{\rho^{\prime} \sigma^{\prime}}\left(x^{\alpha^{\prime}}=a^{\alpha}\right)-\xi_{\rho^{\prime}, \sigma^{\prime}}-\xi_{\sigma^{\prime}, \rho^{\prime}}+gρσ(xα=aα)=ηρσ+hρσ(xα=aα)ξρ,σξσ,ρ+ (corrections).
Example 45.2
Let's check this last expression. We have
(45.20) x μ = x μ ξ μ (45.20) x μ = x μ ξ μ {:(45.20)x^(mu)=x^(mu^('))-xi^(mu):}\begin{equation*} x^{\mu}=x^{\mu^{\prime}}-\xi^{\mu} \tag{45.20} \end{equation*}(45.20)xμ=xμξμ
and so
(45.21) x μ x ρ = δ ρ μ ξ , ρ μ (45.21) x μ x ρ = δ ρ μ ξ , ρ μ {:(45.21)(delx^(mu))/(delx^(rho^(')))=delta_(rho^('))^(mu^('))-xi_(,rho^('))^(mu):}\begin{equation*} \frac{\partial x^{\mu}}{\partial x^{\rho^{\prime}}}=\delta_{\rho^{\prime}}^{\mu^{\prime}}-\xi_{, \rho^{\prime}}^{\mu} \tag{45.21} \end{equation*}(45.21)xμxρ=δρμξ,ρμ
giving the combination
(45.22) x μ x ρ x ν x σ = δ ρ μ δ σ ν ξ , ρ μ δ σ ν ξ , σ ν δ ρ + (45.22) x μ x ρ x ν x σ = δ ρ μ δ σ ν ξ , ρ μ δ σ ν ξ , σ ν δ ρ + {:(45.22)(delx^(mu))/(delx^(rho^(')))*(delx^(nu))/(delx^(sigma^(')))=delta_(rho^('))^(mu)delta_(sigma^('))^(nu)-xi_(,rho^('))^(mu)delta_(sigma^('))^(nu)-xi_(,sigma^('))^(nu)delta_(rho^('))+dots:}\begin{equation*} \frac{\partial x^{\mu}}{\partial x^{\rho^{\prime}}} \cdot \frac{\partial x^{\nu}}{\partial x^{\sigma^{\prime}}}=\delta_{\rho^{\prime}}^{\mu} \delta_{\sigma^{\prime}}^{\nu}-\xi_{, \rho^{\prime}}^{\mu} \delta_{\sigma^{\prime}}^{\nu}-\xi_{, \sigma^{\prime}}^{\nu} \delta_{\rho^{\prime}}+\ldots \tag{45.22} \end{equation*}(45.22)xμxρxνxσ=δρμδσνξ,ρμδσνξ,σνδρ+
where we ignore higher order terms in the derivatives of the transformation components ξ μ ξ μ xi^(mu)\xi^{\mu}ξμ. We then have a right-hand side of eqn 45.18 given by
( η μ ν + h μ ν ) x μ x ρ x ν x σ = η ρ , σ + h ρ σ ξ , , ρ μ η μ σ ξ , σ ν η ρ ν + (45.23) = η ρ , σ + h ρ σ ξ σ ρ ξ ρ σ + η μ ν + h μ ν x μ x ρ x ν x σ = η ρ , σ + h ρ σ ξ , , ρ μ η μ σ ξ , σ ν η ρ ν + (45.23) = η ρ , σ + h ρ σ ξ σ ρ ξ ρ σ + {:[(eta_(mu nu)+h_(mu nu))(delx^(mu))/(delx^(rho^(')))(delx^(nu))/(delx^(sigma^(')))=eta_(rho^('),sigma^('))+h_(rho^(')sigma^('))-xi_(,,rho^('))^(mu)eta_(musigma^('))-xi_(,sigma^('))^(nu)eta_(rho^(')nu)+dots],[(45.23)=eta_(rho^('),sigma^('))+h_(rho^(')sigma^('))-xi_(sigma^(')rho^('))-xi_(rho^(')sigma^('))+dots]:}\begin{align*} \left(\eta_{\mu \nu}+h_{\mu \nu}\right) \frac{\partial x^{\mu}}{\partial x^{\rho^{\prime}}} \frac{\partial x^{\nu}}{\partial x^{\sigma^{\prime}}} & =\eta_{\rho^{\prime}, \sigma^{\prime}}+h_{\rho^{\prime} \sigma^{\prime}}-\xi_{,, \rho^{\prime}}^{\mu} \eta_{\mu \sigma^{\prime}}-\xi_{, \sigma^{\prime}}^{\nu} \eta_{\rho^{\prime} \nu}+\ldots \\ & =\eta_{\rho^{\prime}, \sigma^{\prime}}+h_{\rho^{\prime} \sigma^{\prime}}-\xi_{\sigma^{\prime} \rho^{\prime}}-\xi_{\rho^{\prime} \sigma^{\prime}}+\ldots \tag{45.23} \end{align*}(ημν+hμν)xμxρxνxσ=ηρ,σ+hρσξ,,ρμημσξ,σνηρν+(45.23)=ηρ,σ+hρσξσρξρσ+
where we've retained the lowest order terms only.
We conclude that the components of h h h\boldsymbol{h}h in new and old coordinates are 10 10 ^(10){ }^{10}10
(45.24) h μ ν new = h μ ν old ξ μ , ν ξ ν , μ (45.24) h μ ν new  = h μ ν old  ξ μ , ν ξ ν , μ {:(45.24)h_(mu nu)^("new ")=h_(mu nu)^("old ")-xi_(mu,nu)-xi_(nu,mu):}\begin{equation*} h_{\mu \nu}^{\text {new }}=h_{\mu \nu}^{\text {old }}-\xi_{\mu, \nu}-\xi_{\nu, \mu} \tag{45.24} \end{equation*}(45.24)hμνnew =hμνold ξμ,νξν,μ
All other fields in the linearized theory are not changed appreciably by our infinitesimal coordinate transformations. We sketch this property for the Riemann tensor in the next example.

Example 45.3

We need to show R μ ν α β new = R μ ν α β old R μ ν α β new  = R μ ν α β old  R_(mu nu alpha beta)^("new ")=R_(mu nu alpha beta)^("old ")R_{\mu \nu \alpha \beta}^{\text {new }}=R_{\mu \nu \alpha \beta}^{\text {old }}Rμναβnew =Rμναβold . Under the gauge transformation we have
(45.25) h μ ν new = h μ ν old 2 ξ ( μ , ν ) , (45.25) h μ ν new = h μ ν old  2 ξ ( μ , ν ) , {:(45.25)h_(mu nu)^(new)=h_(mu nu)^("old ")-2xi_((mu,nu))",":}\begin{equation*} h_{\mu \nu}^{\mathrm{new}}=h_{\mu \nu}^{\text {old }}-2 \xi_{(\mu, \nu)}, \tag{45.25} \end{equation*}(45.25)hμνnew=hμνold 2ξ(μ,ν),
where we've used the useful bracket notation for symmetrization. We find
(45.26) R α μ β ν new = R α μ β ν old ξ α [ , ν , β ] , μ + ξ μ [ , ν , β ] , α ξ β [ , μ , α , ] , ν + ξ ν [ , μ , α ] , β , (45.26) R α μ β ν new = R α μ β ν old ξ α [ , ν , β ] , μ + ξ μ [ , ν , β ] , α ξ β [ , μ , α , ] , ν + ξ ν [ , μ , α ] , β , {:(45.26)R_(alpha mu beta nu)^(new)=R_(alpha mu beta nu)^(old)-xi_(alpha[,nu,beta],mu)+xi_(mu[,nu,beta],alpha)-xi_(beta[,mu,alpha,],nu)+xi_(nu[,mu,alpha],beta)",":}\begin{equation*} R_{\alpha \mu \beta \nu}^{\mathrm{new}}=R_{\alpha \mu \beta \nu}^{\mathrm{old}}-\xi_{\alpha[, \nu, \beta], \mu}+\xi_{\mu[, \nu, \beta], \alpha}-\xi_{\beta[, \mu, \alpha,], \nu}+\xi_{\nu[, \mu, \alpha], \beta}, \tag{45.26} \end{equation*}(45.26)Rαμβνnew=Rαμβνoldξα[,ν,β],μ+ξμ[,ν,β],αξβ[,μ,α,],ν+ξν[,μ,α],β,
where here the bracket notation for antisymmetrization has been used. The added terms are all zero (since derivatives commute) and the Riemann tensor is left unchanged.
We are therefore free to make infinitesimal coordinate transformations without changing the physics. Using the analogy from Chapter 44, we note that our choice of coordinates in this context is the same as a choice of gauge. We therefore choose coordinates such that
(45.27) h ¯ , α μ α = 0 (45.27) h ¯ , α μ α = 0 {:(45.27) bar(h)_(,alpha)^(mu alpha)=0:}\begin{equation*} \bar{h}_{, \alpha}^{\mu \alpha}=0 \tag{45.27} \end{equation*}(45.27)h¯,αμα=0
This is analogous to Lorenz gauge in electromagnetism, which is the condition A μ , μ = 0 A μ , μ = 0 A^(mu)_(,mu)=0A^{\mu}{ }_{, \mu}=0Aμ,μ=0. This gauge cause the Einstein equation to reduce to the far friendlier expression
(45.28) h ¯ μ ν , α α = 16 π T μ ν (45.28) h ¯ μ ν , α α = 16 π T μ ν {:(45.28)- bar(h)_(mu nu,alpha)^(alpha)=16 piT_(mu nu):}\begin{equation*} -\bar{h}_{\mu \nu, \alpha}^{\alpha}=16 \pi T_{\mu \nu} \tag{45.28} \end{equation*}(45.28)h¯μν,αα=16πTμν
Written in another way, we have 11 11 ^(11){ }^{11}11
(45.29) 2 h ¯ μ ν = 16 π T μ ν (45.29) 2 h ¯ μ ν = 16 π T μ ν {:(45.29)-del^(2) bar(h)_(mu nu)=16 piT_(mu nu):}\begin{equation*} -\partial^{2} \bar{h}_{\mu \nu}=16 \pi T_{\mu \nu} \tag{45.29} \end{equation*}(45.29)2h¯μν=16πTμν
where h ¯ μ ν = h μ ν 1 2 η μ ν h h ¯ μ ν = h μ ν 1 2 η μ ν h bar(h)_(mu nu)=h_(mu nu)-(1)/(2)eta_(mu nu)h\bar{h}_{\mu \nu}=h_{\mu \nu}-\frac{1}{2} \eta_{\mu \nu} hh¯μν=hμν12ημνh.
We have therefore derived a linear field theory of gravitation. It has the form of a wave equation 12 12 ^(12){ }^{12}12 in the absence of the source term 16 π T μ ν 16 π T μ ν 16 piT_(mu nu)16 \pi T_{\mu \nu}16πTμν.
10 10 ^(10){ }^{10}10 We can compare the situation in gravitation, where a coordinate change x μ ( P ) x μ ( P ) + ξ μ ( P ) x μ ( P ) x μ ( P ) + ξ μ ( P ) x^(mu)(P)rarrx^(mu)(P)+xi^(mu)(P)x^{\mu}(\mathcal{P}) \rightarrow x^{\mu}(\mathcal{P})+\xi^{\mu}(\mathcal{P})xμ(P)xμ(P)+ξμ(P) mandates a change in the metric field components
h μ ν h μ ν ξ μ , ν ξ ν , μ , h μ ν h μ ν ξ μ , ν ξ ν , μ , h_(mu nu)rarrh_(mu nu)-xi_(mu,nu)-xi_(nu,mu),h_{\mu \nu} \rightarrow h_{\mu \nu}-\xi_{\mu, \nu}-\xi_{\nu, \mu},hμνhμνξμ,νξν,μ,
with that of electromagnetism, where a change in an internal variable θ ( P ) θ ( P ) theta(P)rarr\theta(\mathcal{P}) \rightarrowθ(P) θ ( P ) + α ( P ) θ ( P ) + α ( P ) theta(P)+alpha(P)\theta(\mathcal{P})+\alpha(\mathcal{P})θ(P)+α(P) mandates a change in the electromagnetic field components
A μ A μ 1 q α , μ A μ A μ 1 q α , μ A_(mu)rarrA_(mu)-(1)/(q)alpha_(,mu)A_{\mu} \rightarrow A_{\mu}-\frac{1}{q} \alpha_{, \mu}AμAμ1qα,μ
where q q qqq is the electromagnetic charge.
11 11 ^(11){ }^{11}11 We use the notation
2 = 2 t 2 + 2 x 2 + 2 y 2 + 2 z 2 = 2 t 2 + 2 2 = 2 t 2 + 2 x 2 + 2 y 2 + 2 z 2 = 2 t 2 + 2 {:[del^(2)=-(del^(2))/(delt^(2))+(del^(2))/(delx^(2))+(del^(2))/(dely^(2))+(del^(2))/(delz^(2))],[=-(del^(2))/(delt^(2))+ vec(grad)^(2)]:}\begin{aligned} \partial^{2} & =-\frac{\partial^{2}}{\partial t^{2}}+\frac{\partial^{2}}{\partial x^{2}}+\frac{\partial^{2}}{\partial y^{2}}+\frac{\partial^{2}}{\partial z^{2}} \\ & =-\frac{\partial^{2}}{\partial t^{2}}+\vec{\nabla}^{2} \end{aligned}2=2t2+2x2+2y2+2z2=2t2+2
12 12 ^(12){ }^{12}12 Reminder: A wave equation has the form 2 ϕ = 2 ϕ t 2 + 2 ϕ = 0 2 ϕ = 2 ϕ t 2 + 2 ϕ = 0 del^(2)phi=-(del^(2)phi)/(delt^(2))+ vec(grad)^(2)phi=0\partial^{2} \phi=-\frac{\partial^{2} \phi}{\partial t^{2}}+\vec{\nabla}^{2} \phi=02ϕ=2ϕt2+2ϕ=0. Its solutions are plane waves of the form ϕ = A e i ( ω t k x ) ϕ = A e i ( ω t k x ) phi=Ae^(-i(omega t- vec(k)* vec(x)))\phi=A \mathrm{e}^{-\mathrm{i}(\omega t-\vec{k} \cdot \vec{x})}ϕ=Aei(ωtkx).
The analogous expression in electromagnetism is 2 A μ = J μ 2 A μ = J μ -del^(2)A^(mu)=J^(mu)-\partial^{2} A^{\mu}=J^{\mu}2Aμ=Jμ. The (retarded) potential in electromagnetism that solves this latter wave equation is
(45.30) A μ ( t , x ) = 1 4 π d 3 y J μ ( t | x y | , y ) | x y | . (45.30) A μ ( t , x ) = 1 4 π d 3 y J μ ( t | x y | , y ) | x y | . {:(45.30)A^(mu)(t"," vec(x))=(1)/(4pi)intd^(3)y(J^(mu)(t-|( vec(x))-( vec(y))|,( vec(y))))/(|( vec(x))-( vec(y))|).:}\begin{equation*} A^{\mu}(t, \vec{x})=\frac{1}{4 \pi} \int \mathrm{~d}^{3} y \frac{J^{\mu}(t-|\vec{x}-\vec{y}|, \vec{y})}{|\vec{x}-\vec{y}|} . \tag{45.30} \end{equation*}(45.30)Aμ(t,x)=14π d3yJμ(t|xy|,y)|xy|.
This suggests that the solution to the weak-field gravitation equation should be given by the retarded function
(45.31) h ¯ μ ν ( t , x ) = 4 G d 3 y T μ ν ( t | x y | , y ) | x y | , (45.31) h ¯ μ ν ( t , x ) = 4 G d 3 y T μ ν ( t | x y | , y ) | x y | , {:(45.31) bar(h)_(mu nu)(t"," vec(x))=4G intd^(3)y(T_(mu nu)(t-|( vec(x))-( vec(y))|,( vec(y))))/(|( vec(x))-( vec(y))|)",":}\begin{equation*} \bar{h}_{\mu \nu}(t, \vec{x})=4 G \int \mathrm{~d}^{3} y \frac{T_{\mu \nu}(t-|\vec{x}-\vec{y}|, \vec{y})}{|\vec{x}-\vec{y}|}, \tag{45.31} \end{equation*}(45.31)h¯μν(t,x)=4G d3yTμν(t|xy|,y)|xy|,
where the factor of G G GGG has been restored. We shall examine this further in the next two chapters.

Example 45.4

Let's check out eqn 45.31 in the limit of a non-relativistic stationary source (i.e. a static distribution of mass). This problem has no time dependence and so eqn 45.31 then reduces to
(45.32) h ¯ μ ν ( x ) = 4 G d 3 y T μ ν ( y ) | x y | (45.32) h ¯ μ ν ( x ) = 4 G d 3 y T μ ν ( y ) | x y | {:(45.32) bar(h)_(mu nu)( vec(x))=4G intd^(3)y(T_(mu nu)(( vec(y))))/(|( vec(x))-( vec(y))|):}\begin{equation*} \bar{h}_{\mu \nu}(\vec{x})=4 G \int \mathrm{~d}^{3} y \frac{T_{\mu \nu}(\vec{y})}{|\vec{x}-\vec{y}|} \tag{45.32} \end{equation*}(45.32)h¯μν(x)=4G d3yTμν(y)|xy|
and in the weak-field limit the only significant term in T μ ν ( y ) T μ ν ( y ) T_(mu nu)( vec(y))T_{\mu \nu}(\vec{y})Tμν(y) is T 00 ( y ) ρ ( y ) T 00 ( y ) ρ ( y ) T_(00)( vec(y))~~rho( vec(y))T_{00}(\vec{y}) \approx \rho(\vec{y})T00(y)ρ(y), where ρ ( y ) ρ ( y ) rho( vec(y))\rho(\vec{y})ρ(y) is the density distribution of the source. Hence, we only have to worry about
(45.33) h ¯ 00 ( x ) = 4 G d 3 y ρ ( y ) | x y | . (45.33) h ¯ 00 ( x ) = 4 G d 3 y ρ ( y ) | x y | . {:(45.33) bar(h)_(00)( vec(x))=4G intd^(3)y(rho(( vec(y))))/(|( vec(x))-( vec(y))|).:}\begin{equation*} \bar{h}_{00}(\vec{x})=4 G \int \mathrm{~d}^{3} y \frac{\rho(\vec{y})}{|\vec{x}-\vec{y}|} . \tag{45.33} \end{equation*}(45.33)h¯00(x)=4G d3yρ(y)|xy|.
However, this equation reminds us of the equation for the gravitational scalar potential Φ ( x ) Φ ( x ) Phi( vec(x))\Phi(\vec{x})Φ(x) which is
(45.34) Φ ( x ) = G d 3 y ρ ( y ) | x y | (45.34) Φ ( x ) = G d 3 y ρ ( y ) | x y | {:(45.34)Phi( vec(x))=-G intd^(3)y(rho(( vec(y))))/(|( vec(x))-( vec(y))|):}\begin{equation*} \Phi(\vec{x})=-G \int \mathrm{~d}^{3} y \frac{\rho(\vec{y})}{|\vec{x}-\vec{y}|} \tag{45.34} \end{equation*}(45.34)Φ(x)=G d3yρ(y)|xy|
from which we immediately deduce that the only non-negligible term in h ¯ μ ν h ¯ μ ν bar(h)_(mu nu)\bar{h}_{\mu \nu}h¯μν is
(45.35) h ¯ 00 ( x ) = 4 Φ ( x ) . (45.35) h ¯ 00 ( x ) = 4 Φ ( x ) . {:(45.35) bar(h)_(00)( vec(x))=-4Phi( vec(x)).:}\begin{equation*} \bar{h}_{00}(\vec{x})=-4 \Phi(\vec{x}) . \tag{45.35} \end{equation*}(45.35)h¯00(x)=4Φ(x).
This is exactly the same conclusion we came to in Example 14.2, and following the same argument we can quickly deduce that
(45.36) h 00 = h 11 = h 22 = h 33 = 2 Φ (45.36) h 00 = h 11 = h 22 = h 33 = 2 Φ {:(45.36)h_(00)=h_(11)=h_(22)=h_(33)=-2Phi:}\begin{equation*} h_{00}=h_{11}=h_{22}=h_{33}=-2 \Phi \tag{45.36} \end{equation*}(45.36)h00=h11=h22=h33=2Φ
so that the metric can be written as
(45.37) d s 2 = ( 1 + 2 Φ ) d t 2 + ( 1 2 Φ ) ( d x 2 + d y 2 + d z 2 ) (45.37) d s 2 = ( 1 + 2 Φ ) d t 2 + ( 1 2 Φ ) d x 2 + d y 2 + d z 2 {:(45.37)ds^(2)=-(1+2Phi)dt^(2)+(1-2Phi)(dx^(2)+dy^(2)+dz^(2)):}\begin{equation*} \mathrm{d} s^{2}=-(1+2 \Phi) \mathrm{d} t^{2}+(1-2 \Phi)\left(\mathrm{d} x^{2}+\mathrm{d} y^{2}+\mathrm{d} z^{2}\right) \tag{45.37} \end{equation*}(45.37)ds2=(1+2Φ)dt2+(12Φ)(dx2+dy2+dz2)
which is, of course, the weak-field metric (eqn 5.22, derived in Section 14.1).

Example 45.5

The energy-momentum tensor for dust takes the form T μ ν = ρ u μ u ν T μ ν = ρ u μ u ν T^(mu nu)=rhou^(mu)u^(nu)T^{\mu \nu}=\rho u^{\mu} u^{\nu}Tμν=ρuμuν, and so we only keep the dominant term T 00 = ρ T 00 = ρ T^(00)=rhoT^{00}=\rhoT00=ρ in the weak-field limit. The next-most important
13 13 ^(13){ }^{13}13 Recall that, inserting the factors of c c ccc, that T 00 = ρ c 2 T 00 = ρ c 2 T^(00)=rhoc^(2)T^{00}=\rho c^{2}T00=ρc2 and T 0 i = ρ c u i T 0 i = ρ c u i T^(0i)=rho cu^(i)T^{0 i}=\rho c u^{i}T0i=ρcui. terms are T 0 i = ρ u i T 0 i = ρ u i T^(0i)=rhou^(i)T^{0 i}=\rho u^{i}T0i=ρui and these terms are smaller than T 00 T 00 T^(00)T^{00}T00 by a factor 13 v / c 13 v / c ^(13)v//c{ }^{13} v / c13v/c. The terms T i j = ρ u i u j T i j = ρ u i u j T^(ij)=rhou^(i)u^(j)T^{i j}=\rho u^{i} u^{j}Tij=ρuiuj are smaller than T 00 T 00 T^(00)T^{00}T00 by a factor of ( v / c ) 2 ( v / c ) 2 (v//c)^(2)(v / c)^{2}(v/c)2 and so can be safely ignored, but now let's consider the effect of the T 0 i T 0 i T^(0i)T^{0 i}T0i terms. We can now relate h ¯ 0 i h ¯ 0 i bar(h)_(0i)\bar{h}_{0 i}h¯0i to a gravitational analogue of the vector potential A g A g vec(A)_(g)\vec{A}_{\mathrm{g}}Ag by writing
(45.38) A g i ( x ) = G d 3 y J i ( y ) | x y | (45.38) A g i ( x ) = G d 3 y J i ( y ) | x y | {:(45.38)A_(g)^(i)( vec(x))=-G intd^(3)y(J^(i)(( vec(y))))/(|( vec(x))-( vec(y))|):}\begin{equation*} A_{\mathrm{g}}^{i}(\vec{x})=-G \int \mathrm{~d}^{3} y \frac{J^{i}(\vec{y})}{|\vec{x}-\vec{y}|} \tag{45.38} \end{equation*}(45.38)Agi(x)=G d3yJi(y)|xy|
where J i ( y ) = ρ ( y ) u i ( y ) J i ( y ) = ρ ( y ) u i ( y ) J^(i)( vec(y))=rho( vec(y))u^(i)( vec(y))J^{i}(\vec{y})=\rho(\vec{y}) u^{i}(\vec{y})Ji(y)=ρ(y)ui(y) is the current density, in which case we deduce that (on lowering indices, and using h 0 i = h ¯ 0 i h 0 i = h ¯ 0 i h_(0i)= bar(h)_(0i)h_{0 i}=\bar{h}_{0 i}h0i=h¯0i )
(45.39) h ¯ 0 i ( x ) = 4 A g i ( x ) (45.39) h ¯ 0 i ( x ) = 4 A g i ( x ) {:(45.39) bar(h)_(0i)( vec(x))=-4A_(gi)( vec(x)):}\begin{equation*} \bar{h}_{0 i}(\vec{x})=-4 A_{\mathrm{g} i}(\vec{x}) \tag{45.39} \end{equation*}(45.39)h¯0i(x)=4Agi(x)
Now, using our linearized equation of gravitation (eqn 45.29), and ignoring the smaller time-dependent term, we have
(45.40) 2 Φ = 4 π G ρ and 2 A g = 4 π G J (45.40) 2 Φ = 4 π G ρ  and  2 A g = 4 π G J {:(45.40)grad^(2)Phi=4pi G rhoquad" and "quadgrad^(2) vec(A)_(g)=4pi G vec(J):}\begin{equation*} \nabla^{2} \Phi=4 \pi G \rho \quad \text { and } \quad \nabla^{2} \vec{A}_{\mathrm{g}}=4 \pi G \vec{J} \tag{45.40} \end{equation*}(45.40)2Φ=4πGρ and 2Ag=4πGJ
which are the Poisson's equations for the gravitational scalar and vector potential. These are, of course, reminiscent of the analogous equations for electromagnetism, namely
(45.41) 2 V = ρ ϵ 0 and 2 A = μ 0 J (45.41) 2 V = ρ ϵ 0  and  2 A = μ 0 J {:(45.41)grad^(2)V=-(rho)/(epsilon_(0))quad" and "quadgrad^(2) vec(A)=-mu_(0) vec(J):}\begin{equation*} \nabla^{2} V=-\frac{\rho}{\epsilon_{0}} \quad \text { and } \quad \nabla^{2} \vec{A}=-\mu_{0} \vec{J} \tag{45.41} \end{equation*}(45.41)2V=ρϵ0 and 2A=μ0J
where the minus signs reflect the fact that like charges repel in electromagnetism but masses attract in gravity. Pursuing this analogy, we can define the gravitoelectric field E g = Φ E g = Φ vec(E)_(g)=- vec(grad)Phi\vec{E}_{\mathrm{g}}=-\vec{\nabla} \PhiEg=Φ and the gravitomagnetic field B g = × A g B g = × A g vec(B)_(g)= vec(grad)xx vec(A)_(g)\vec{B}_{\mathrm{g}}=\vec{\nabla} \times \vec{A}_{\mathrm{g}}Bg=×Ag and, by analogy with electromagnetism, these will satisfy the gravitational Maxwell equations
(45.42) E g = 4 π G ρ , B g = 0 , × E g = 0 , × B g = 4 π G J (45.42) E g = 4 π G ρ , B g = 0 , × E g = 0 , × B g = 4 π G J {:(45.42) vec(grad)* vec(E)_(g)=-4pi G rho","quad vec(grad)* vec(B)_(g)=0","quad vec(grad)xx vec(E)_(g)=0","quad vec(grad)xx vec(B)_(g)=-4pi G vec(J):}\begin{equation*} \vec{\nabla} \cdot \vec{E}_{\mathrm{g}}=-4 \pi G \rho, \quad \vec{\nabla} \cdot \vec{B}_{\mathrm{g}}=0, \quad \vec{\nabla} \times \vec{E}_{\mathrm{g}}=0, \quad \vec{\nabla} \times \vec{B}_{\mathrm{g}}=-4 \pi G \vec{J} \tag{45.42} \end{equation*}(45.42)Eg=4πGρ,Bg=0,×Eg=0,×Bg=4πGJ
These are analogous to the electromagnetic Maxwell equations ignoring any timedependence. This analogy between gravity and electromagnetism can be taken further and it can be shown that, just as a charge current in electromagnetism creates a magnetic field that exerts at v × B v × B vec(v)xx vec(B)\vec{v} \times \vec{B}v×B force, so a current of mass creates a gravitomagnetic field that exerts a v × B g v × B g vec(v)xx vec(B)_(g)\vec{v} \times \vec{B}_{\mathrm{g}}v×Bg force. This is the basis of the Lense-Thirring effect in which a spinning mass (in which you have rotational currents of mass) will drag objects around with the rotation. 14 14 ^(14){ }^{14}14
Weak-field gravitation is a very useful limit for a lot of the Universe we encounter for which gravity is a relatively small perturbation. After all, most of the Universe is pretty empty of matter. In the next chapter, we will solve eqn 45.29 in matter-free spacetime, so that the energymomentum tensor T = 0 T = 0 T=0\boldsymbol{T}=0T=0 and we then have 2 h ¯ μ ν = ( 2 t 2 2 ) h ¯ μ ν = 2 h ¯ μ ν = 2 t 2 2 h ¯ μ ν = -del^(2) bar(h)_(mu nu)=((del^(2))/(delt^(2))- vec(grad)^(2)) bar(h)_(mu nu)=-\partial^{2} \bar{h}_{\mu \nu}=\left(\frac{\partial^{2}}{\partial t^{2}}-\vec{\nabla}^{2}\right) \bar{h}_{\mu \nu}=2h¯μν=(2t22)h¯μν= 0 . This is a wave equation and predicts the existence of gravitational waves!

Chapter summary

  • Weak-field gravitation can be approximately described using a Minkowski metric with a small additive correction given by the weak-field metric h ( x ) h ( x ) h(x)\boldsymbol{h}(x)h(x).
  • In terms of the weak-field metric, the Einstein equation is
(45.43) 2 h ¯ μ ν = 16 π T μ ν (45.43) 2 h ¯ μ ν = 16 π T μ ν {:(45.43)-del^(2) bar(h)_(mu nu)=16 piT_(mu nu):}\begin{equation*} -\partial^{2} \bar{h}_{\mu \nu}=16 \pi T_{\mu \nu} \tag{45.43} \end{equation*}(45.43)2h¯μν=16πTμν
where h ¯ μ ν = h μ ν 1 2 η μ ν h h ¯ μ ν = h μ ν 1 2 η μ ν h bar(h)_(mu nu)=h_(mu nu)-(1)/(2)eta_(mu nu)h\bar{h}_{\mu \nu}=h_{\mu \nu}-\frac{1}{2} \eta_{\mu \nu} hh¯μν=hμν12ημνh. This is a wave equation in the absence of sources of mass-energy.
  • Under an infinitesimal coordinate change x μ x μ + ξ μ x μ x μ + ξ μ x^(mu^('))rarrx^(mu)+xi^(mu)x^{\mu^{\prime}} \rightarrow x^{\mu}+\xi^{\mu}xμxμ+ξμ, we have
(45.44) h μ ν h μ ν 2 ξ ( μ , ν ) . (45.44) h μ ν h μ ν 2 ξ ( μ , ν ) . {:(45.44)h_(mu nu)rarrh_(mu nu)-2xi_((mu,nu)).:}\begin{equation*} h_{\mu \nu} \rightarrow h_{\mu \nu}-2 \xi_{(\mu, \nu)} . \tag{45.44} \end{equation*}(45.44)hμνhμν2ξ(μ,ν).

Exercises

(45.1) Verify (a) eqn 45.10
(b) eqn 45.14;
(c) eqn 45.26 .
(45.2) (a) Take a trace to show that
(45.45) 2 h = 16 π T (45.45) 2 h = 16 π T {:(45.45)del^(2)h=16 pi T:}\begin{equation*} \partial^{2} h=16 \pi T \tag{45.45} \end{equation*}(45.45)2h=16πT
where h h hhh is the trace of the tensor h h h\boldsymbol{h}h.
(b) Show that the weak-field, linearized Einstein equation can be rewritten as
(45.46) 2 h μ ν = 16 π ( T μ ν 1 2 η μ ν T ) (45.46) 2 h μ ν = 16 π T μ ν 1 2 η μ ν T {:(45.46)-del^(2)h_(mu nu)=16 pi(T_(mu nu)-(1)/(2)eta_(mu nu)T):}\begin{equation*} -\partial^{2} h_{\mu \nu}=16 \pi\left(T_{\mu \nu}-\frac{1}{2} \eta_{\mu \nu} T\right) \tag{45.46} \end{equation*}(45.46)2hμν=16π(Tμν12ημνT)
(c) For a stationary source, explain why the solution to the Einstein equation
(45.47) 2 h μ ν = 16 π ( T μ ν 1 2 η μ ν T ) , (45.47) 2 h μ ν = 16 π T μ ν 1 2 η μ ν T , {:(45.47) vec(grad)^(2)h_(mu nu)=-16 pi(T_(mu nu)-(1)/(2)eta_(mu nu)T)",":}\begin{equation*} \vec{\nabla}^{2} h_{\mu \nu}=-16 \pi\left(T_{\mu \nu}-\frac{1}{2} \eta_{\mu \nu} T\right), \tag{45.47} \end{equation*}(45.47)2hμν=16π(Tμν12ημνT),
take the form
(45.48) h μ ν ( x ) = 2 d Σ ( 2 T μ ν η μ ν T ) | x x | (45.48) h μ ν ( x ) = 2 d Σ 2 T μ ν η μ ν T x x {:(45.48)h_(mu nu)( vec(x))=2intdSigma((2T_(mu nu)-eta_(mu nu)T))/(|( vec(x))- vec(x)^(')|):}\begin{equation*} h_{\mu \nu}(\vec{x})=2 \int \mathrm{~d} \Sigma \frac{\left(2 T_{\mu \nu}-\eta_{\mu \nu} T\right)}{\left|\vec{x}-\vec{x}^{\prime}\right|} \tag{45.48} \end{equation*}(45.48)hμν(x)=2 dΣ(2TμνημνT)|xx|
where d Σ d Σ dSigma\mathrm{d} \SigmadΣ is an element of 3 -volume and the energy-momentum tensor is evaluated at position x x vec(x)^(')\vec{x}^{\prime}x.
(45.3) For a low-velocity perfect fluid in the weak-field limit, show the following:
(a) The gravitoelectric energy density, defined as ρ g = 2 T t t η t t T ρ g = 2 T t t η t t T rho_(g)=2T_(tt)-eta_(tt)T\rho_{\mathrm{g}}=2 T_{t t}-\eta_{t t} Tρg=2TttηttT, is given by
(45.49) ρ g ρ + 3 p (45.49) ρ g ρ + 3 p {:(45.49)rho_(g)~~rho+3p:}\begin{equation*} \rho_{\mathrm{g}} \approx \rho+3 p \tag{45.49} \end{equation*}(45.49)ρgρ+3p
(b) The gravitometric current density, Π i = Π i = Pi_(i)=\Pi_{i}=Πi= T t i + 1 2 η t i T T t i + 1 2 η t i T -T_(ti)+(1)/(2)eta_(ti)T-T_{t i}+\frac{1}{2} \eta_{t i} TTti+12ηtiT, is given by
(45.50) Π i ( ρ + p ) u i (45.50) Π i ( ρ + p ) u i {:(45.50)Pi_(i)~~(rho+p)u_(i):}\begin{equation*} \Pi_{i} \approx(\rho+p) u_{i} \tag{45.50} \end{equation*}(45.50)Πi(ρ+p)ui
where u i u i u_(i)u_{i}ui are velocity 1 -form components.
(c) The curvature energy density, ρ c = 2 T i i ρ c = 2 T i i rho_(c)=2T_(ii)-\rho_{c}=2 T_{i i}-ρc=2Tii η i i T η i i T eta_(ii)T\eta_{i i} TηiiT, is given by
(45.51) ρ c ρ p (45.51) ρ c ρ p {:(45.51)rho_(c)~~rho-p:}\begin{equation*} \rho_{c} \approx \rho-p \tag{45.51} \end{equation*}(45.51)ρcρp
This curvature energy density is the quantity in eqn 45.48 that causes the diagonal, spatial parts of h h h\boldsymbol{h}h to be non-zero. These are the parts of g g g\boldsymbol{g}g that give the deviations from a flat-spacetime metric.
(45.4) (a) Show that a particle moving in the weak, stationary field obeys the geodesic equation
(45.52) d 2 x i d t 2 η i j ( j Φ + F j k v k ) (45.52) d 2 x i d t 2 η i j j Φ + F j k v k {:(45.52)(d^(2)x^(i))/((d)t^(2))~~eta^(ij)(-del_(j)Phi+F_(jk)v^(k)):}\begin{equation*} \frac{\mathrm{d}^{2} x^{i}}{\mathrm{~d} t^{2}} \approx \eta^{i j}\left(-\partial_{j} \Phi+F_{j k} v^{k}\right) \tag{45.52} \end{equation*}(45.52)d2xi dt2ηij(jΦ+Fjkvk)
where v k = d x k / d t v k = d x k / d t v^(k)=dx^(k)//dtv^{k}=\mathrm{d} x^{k} / \mathrm{d} tvk=dxk/dt and
Φ = h t t / 2 (45.53) F i j = i h t j j h t i Φ = h t t / 2 (45.53) F i j = i h t j j h t i {:[Phi=-h_(tt)//2],[(45.53)F_(ij)=del_(i)h_(tj)-del_(j)h_(ti)]:}\begin{align*} \Phi & =-h_{t t} / 2 \\ F_{i j} & =\partial_{i} h_{t j}-\partial_{j} h_{t i} \tag{45.53} \end{align*}Φ=htt/2(45.53)Fij=ihtjjhti
(b) Show that the source of Φ Φ Phi\PhiΦ is given by evaluating
(45.54) Φ ( x ) = d 3 x ρ g ( x ) | x x | (45.54) Φ ( x ) = d 3 x ρ g x x x {:(45.54)Phi( vec(x))=-intd^(3)x^(')(rho_(g)( vec(x)^(')))/(|( vec(x))- vec(x)^(')|):}\begin{equation*} \Phi(\vec{x})=-\int \mathrm{d}^{3} x^{\prime} \frac{\rho_{\mathrm{g}}\left(\vec{x}^{\prime}\right)}{\left|\vec{x}-\vec{x}^{\prime}\right|} \tag{45.54} \end{equation*}(45.54)Φ(x)=d3xρg(x)|xx|
where ρ g ρ g rho_(g)\rho_{\mathrm{g}}ρg was defined in the previous question.
(45.5) (a) Show that in a Universe filled with only vacuum energy, the gravitoelectric energy density is negative, making gravity repulsive.
(b) Show further that in terms of the potential Φ Φ Phi\PhiΦ, we obtain an equation 2 Φ = Λ 2 Φ = Λ vec(grad)^(2)Phi=-Lambda\vec{\nabla}^{2} \Phi=-\Lambda2Φ=Λ, which is solved by a potential Φ = α r 2 Φ = α r 2 Phi=alphar^(2)\Phi=\alpha r^{2}Φ=αr2, where α α alpha\alphaα is a constant that should be determined.
(45.6) We shall demonstrate the precession of angular momentum in a gravitational field. We follow the approach of Ryder (2009).
In the weak-field limit, the geometry outside a rotating object leads to a metric with components
( ( 1 + 2 ϕ ) ζ 1 ζ 2 ζ 3 ζ 1 1 2 ϕ 0 0 ζ 2 0 1 2 ϕ 0 ζ 3 0 0 1 2 ϕ ) ( 1 + 2 ϕ ) ζ 1 ζ 2 ζ 3 ζ 1 1 2 ϕ 0 0 ζ 2 0 1 2 ϕ 0 ζ 3 0 0 1 2 ϕ ([-(1+2phi),zeta_(1),zeta_(2),zeta_(3)],[zeta_(1),1-2phi,0,0],[zeta_(2),0,1-2phi,0],[zeta_(3),0,0,1-2phi])\left(\begin{array}{cccc} -(1+2 \phi) & \zeta_{1} & \zeta_{2} & \zeta_{3} \\ \zeta_{1} & 1-2 \phi & 0 & 0 \\ \zeta_{2} & 0 & 1-2 \phi & 0 \\ \zeta_{3} & 0 & 0 & 1-2 \phi \end{array}\right)((1+2ϕ)ζ1ζ2ζ3ζ112ϕ00ζ2012ϕ0ζ30012ϕ)
where
ϕ = M r (45.56) ζ i = 2 r 3 ε i j k x j J k ϕ = M r (45.56) ζ i = 2 r 3 ε i j k x j J k {:[phi=-(M)/(r)],[(45.56)zeta_(i)=(2)/(r^(3))*epsi_(ijk)x^(j)J^(k)]:}\begin{align*} \phi & =-\frac{M}{r} \\ \zeta_{i} & =\frac{2}{r^{3}} \cdot \varepsilon_{i j k} x^{j} J^{k} \tag{45.56} \end{align*}ϕ=Mr(45.56)ζi=2r3εijkxjJk
where J i J i J^(i)J^{i}Ji are components of the angular momentum of the source, which has mass M M MMM.
(a) Assuming that, for application to the Earth, the quantities ϕ 2 ϕ 2 phi^(2)\phi^{2}ϕ2 and ϕ ζ i ϕ ζ i phizeta_(i)\phi \zeta_{i}ϕζi and ζ i ζ j ζ i ζ j zeta_(i)zeta_(j)\zeta_{i} \zeta_{j}ζiζj can be ignored, show that the components of g μ ν g μ ν g^(mu nu)g^{\mu \nu}gμν are the same as
those of g μ ν g μ ν g_(mu nu)g_{\mu \nu}gμν with the sign in front of ϕ ϕ phi\phiϕ reversed.
(b) Show that, to leading order, the connection coefficients are
Γ 0 i 0 = i ϕ , Γ k i 0 = 1 2 ( ζ k , i ζ i , k ) , Γ k i m = δ i m η l k l ϕ δ i k m ϕ δ m k i ϕ , (45.57) Γ 0 i m = 1 2 ( ζ i , m + ζ m , i ) . Γ 0 i 0 = i ϕ , Γ k i 0 = 1 2 ζ k , i ζ i , k , Γ k i m = δ i m η l k l ϕ δ i k m ϕ δ m k i ϕ , (45.57) Γ 0 i m = 1 2 ζ i , m + ζ m , i . {:[Gamma^(0)_(i0)=grad_(i)phi","],[Gamma^(k)_(i0)=(1)/(2)(zeta_(k,i)-zeta_(i,k))","],[Gamma^(k)_(im)=delta_(im)eta^(lk)grad_(l)phi-delta_(i)^(k)grad_(m)phi-delta_(m)^(k)grad_(i)phi","],[(45.57)Gamma^(0)_(im)=-(1)/(2)(zeta_(i,m)+zeta_(m,i)).]:}\begin{align*} \Gamma^{0}{ }_{i 0} & =\nabla_{i} \phi, \\ \Gamma^{k}{ }_{i 0} & =\frac{1}{2}\left(\zeta_{k, i}-\zeta_{i, k}\right), \\ \Gamma^{k}{ }_{i m} & =\delta_{i m} \eta^{l k} \nabla_{l} \phi-\delta_{i}^{k} \nabla_{m} \phi-\delta_{m}^{k} \nabla_{i} \phi, \\ \Gamma^{0}{ }_{i m} & =-\frac{1}{2}\left(\zeta_{i, m}+\zeta_{m, i}\right) . \tag{45.57} \end{align*}Γ0i0=iϕ,Γki0=12(ζk,iζi,k),Γkim=δimηlklϕδikmϕδmkiϕ,(45.57)Γ0im=12(ζi,m+ζm,i).
(45.7) Define a ( 3 + 1 ) ( 3 + 1 ) (3+1)(3+1)(3+1)-dimensional spin by a 1 -form with components S μ S μ S_(mu)S_{\mu}Sμ which is orthogonal to the velocity.
(a) Show that S 0 = ( d x j / d t ) S i S 0 = d x j / d t S i S_(0)=-(dx^(j)//dt)S_(i)S_{0}=-\left(\mathrm{d} x^{j} / \mathrm{d} t\right) S_{i}S0=(dxj/dt)Si.
(b) If such a spin is parallel transported along a geodesic, show further that
d S i d t = ( Γ 0 0 i d x k d t Γ 0 j i d x j d t d x k d t (45.58) + Γ k 0 i + Γ k j i d x j d t ) S k d S i d t = Γ 0 0 i d x k d t Γ 0 j i d x j d t d x k d t (45.58) + Γ k 0 i + Γ k j i d x j d t S k {:[(dS_(i))/((d)t)=(-Gamma^(0)_(0i)((d)x^(k))/((d)t)-Gamma^(0)_(ji)((d)x^(j))/((d)t)*((d)x^(k))/((d)t):}],[(45.58){:+Gamma^(k)_(0i)+Gamma^(k)_(ji)((d)x^(j))/((d)t)*)S_(k)]:}\begin{align*} \frac{\mathrm{d} S_{i}}{\mathrm{~d} t}= & \left(-\Gamma^{0}{ }_{0 i} \frac{\mathrm{~d} x^{k}}{\mathrm{~d} t}-\Gamma^{0}{ }_{j i} \frac{\mathrm{~d} x^{j}}{\mathrm{~d} t} \cdot \frac{\mathrm{~d} x^{k}}{\mathrm{~d} t}\right. \\ & \left.+\Gamma^{k}{ }_{0 i}+\Gamma^{k}{ }_{j i} \frac{\mathrm{~d} x^{j}}{\mathrm{~d} t} \cdot\right) S_{k} \tag{45.58} \end{align*}dSi dt=(Γ00i dxk dtΓ0ji dxj dt dxk dt(45.58)+Γk0i+Γkji dxj dt)Sk
(c) Using the result of Exercise 45.6, show that the evolution of the spin in orbit around the Earth follows the equation of motion
d S i d t = [ ( i ϕ ) v k + 1 2 ( ζ k , i ζ i , k ) + ( δ i m η l k l ϕ (45.59) δ k i m ϕ δ k m ϕ ) v m ] S k . d S i d t = i ϕ v k + 1 2 ζ k , i ζ i , k + δ i m η l k l ϕ (45.59) δ k i m ϕ δ k m ϕ v m S k . {:[(dS_(i))/((d)t)=[-(grad_(i)phi)v^(k)+(1)/(2)(zeta_(k,i)-zeta_(i,k))+(delta_(im)eta^(lk)grad_(l)phi:}],[(45.59){:-delta_(ki)grad_(m)phi-delta_(km)grad phi)v^(m)]S_(k).]:}\begin{align*} \frac{\mathrm{d} S_{i}}{\mathrm{~d} t}= & {\left[-\left(\nabla_{i} \phi\right) v^{k}+\frac{1}{2}\left(\zeta_{k, i}-\zeta_{i, k}\right)+\left(\delta_{i m} \eta^{l k} \nabla_{l} \phi\right.\right.} \\ & \left.\left.-\delta_{k i} \nabla_{m} \phi-\delta_{k m} \nabla \phi\right) v^{m}\right] S_{k} . \tag{45.59} \end{align*}dSi dt=[(iϕ)vk+12(ζk,iζi,k)+(δimηlklϕ(45.59)δkimϕδkmϕ)vm]Sk.
(45.8) Using the metric in Exercise 45.6, show that to order v 2 S 2 v 2 S 2 v^(2) vec(S)^(2)v^{2} \vec{S}^{2}v2S2 or ϕ S 2 ϕ S 2 phi vec(S)^(2)\phi \vec{S}^{2}ϕS2, the quantity
(45.60) S 2 + 2 ϕ S 2 ( v S ) 2 (45.60) S 2 + 2 ϕ S 2 ( v S ) 2 {:(45.60) vec(S)^(2)+2phi vec(S)^(2)-( vec(v)* vec(S))^(2):}\begin{equation*} \vec{S}^{2}+2 \phi \vec{S}^{2}-(\vec{v} \cdot \vec{S})^{2} \tag{45.60} \end{equation*}(45.60)S2+2ϕS2(vS)2
is a constant when parallel transported.
(45.9) To see the consequences of Exercise 45.7, we define a slightly updated version of the spin vector through the 3 -vector equation
(45.61) S = ( 1 ϕ ) Σ + 1 2 ( v Σ ) v (45.61) S = ( 1 ϕ ) Σ + 1 2 ( v Σ ) v {:(45.61) vec(S)=(1-phi) vec(Sigma)+(1)/(2)( vec(v)* vec(Sigma)) vec(v):}\begin{equation*} \vec{S}=(1-\phi) \vec{\Sigma}+\frac{1}{2}(\vec{v} \cdot \vec{\Sigma}) \vec{v} \tag{45.61} \end{equation*}(45.61)S=(1ϕ)Σ+12(vΣ)v
Working to order v 2 S 2 v 2 S 2 vec(v)^(2) vec(S)^(2)\vec{v}^{2} \vec{S}^{2}v2S2 and ϕ S 2 ϕ S 2 phi vec(S)^(2)\phi \vec{S}^{2}ϕS2, and using the fact that the spin 4 -vector is parallel transported, show that
(a) ϕ S 2 = ϕ Σ 2 ϕ S 2 = ϕ Σ 2 phi vec(S)^(2)=phi vec(Sigma)^(2)\phi \vec{S}^{2}=\phi \vec{\Sigma}^{2}ϕS2=ϕΣ2.
(b) ( v S ) 2 = ( v Σ ) 2 ( v S ) 2 = ( v Σ ) 2 ( vec(v)* vec(S))^(2)=( vec(v)* vec(Sigma))^(2)(\vec{v} \cdot \vec{S})^{2}=(\vec{v} \cdot \vec{\Sigma})^{2}(vS)2=(vΣ)2,
and hence that
(c) Σ 2 = Σ 2 = vec(Sigma)^(2)=\vec{\Sigma}^{2}=Σ2= const.
This new 3-vector has a constant length and so is suitable for demonstrating precession.
(d) Explain why
(45.62) d ϕ d t = ϕ v (45.62) d ϕ d t = ϕ v {:(45.62)(dphi)/((d)t)= vec(grad)phi* vec(v):}\begin{equation*} \frac{\mathrm{d} \phi}{\mathrm{~d} t}=\vec{\nabla} \phi \cdot \vec{v} \tag{45.62} \end{equation*}(45.62)dϕ dt=ϕv
and d v / d t = ϕ d v / d t = ϕ d vec(v)//dt=- vec(grad)phi\mathrm{d} \vec{v} / \mathrm{d} t=-\vec{\nabla} \phidv/dt=ϕ.
At the level of approximation we're using, we have
(45.63) Σ = ( 1 + ϕ ) S 1 2 ( v S ) v (45.63) Σ = ( 1 + ϕ ) S 1 2 ( v S ) v {:(45.63) vec(Sigma)=(1+phi) vec(S)-(1)/(2)( vec(v)* vec(S)) vec(v):}\begin{equation*} \vec{\Sigma}=(1+\phi) \vec{S}-\frac{1}{2}(\vec{v} \cdot \vec{S}) \vec{v} \tag{45.63} \end{equation*}(45.63)Σ=(1+ϕ)S12(vS)v
(e) Use the results from part (d) to show that, if we ignore terms of order v 2 d S / d t v 2 d S / d t v^(2)d vec(S)//dtv^{2} \mathrm{~d} \vec{S} / \mathrm{d} tv2 dS/dt or ϕ d S / d t ϕ d S / d t phid vec(S)//dt\phi \mathrm{d} \vec{S} / \mathrm{d} tϕdS/dt, we have
d Σ d t = d S d t + ( ϕ v ) S + 1 2 ( v S ) ϕ + 1 2 ( ϕ S ) v d Σ d t = d S d t + ( ϕ v ) S + 1 2 ( v S ) ϕ + 1 2 ( ϕ S ) v (d( vec(Sigma)))/(dt)=(d( vec(S)))/((d)t)+( vec(grad)phi* vec(v)) vec(S)+(1)/(2)( vec(v)* vec(S)) vec(grad)phi+(1)/(2)( vec(grad)phi* vec(S)) vec(v)\frac{\mathrm{d} \vec{\Sigma}}{\mathrm{d} t}=\frac{\mathrm{d} \vec{S}}{\mathrm{~d} t}+(\vec{\nabla} \phi \cdot \vec{v}) \vec{S}+\frac{1}{2}(\vec{v} \cdot \vec{S}) \vec{\nabla} \phi+\frac{1}{2}(\vec{\nabla} \phi \cdot \vec{S}) \vec{v}dΣdt=dS dt+(ϕv)S+12(vS)ϕ+12(ϕS)v. (45.64)
(f) Using the result of Exercise 45.7, show finally that
(45.65) d Σ d t = Ω × Σ (45.65) d Σ d t = Ω × Σ {:(45.65)(d( vec(Sigma)))/((d)t)= vec(Omega)xx vec(Sigma):}\begin{equation*} \frac{\mathrm{d} \vec{\Sigma}}{\mathrm{~d} t}=\vec{\Omega} \times \vec{\Sigma} \tag{45.65} \end{equation*}(45.65)dΣ dt=Ω×Σ
where
(45.66) Ω = 1 2 × ζ 3 2 v × ϕ (45.66) Ω = 1 2 × ζ 3 2 v × ϕ {:(45.66) vec(Omega)=-(1)/(2) vec(grad)xx vec(zeta)-(3)/(2) vec(v)xx vec(grad)phi:}\begin{equation*} \vec{\Omega}=-\frac{1}{2} \vec{\nabla} \times \vec{\zeta}-\frac{3}{2} \vec{v} \times \vec{\nabla} \phi \tag{45.66} \end{equation*}(45.66)Ω=12×ζ32v×ϕ
Hint: It is permissible to swap S S vec(S)\vec{S}S for Σ Σ vec(Sigma)\vec{\Sigma}Σ in the final step to obtain the quoted equation of motion. This is the equation of motion for the precession of a vector Σ Σ vec(Sigma)\vec{\Sigma}Σ around the direction of Ω Ω vec(Omega)\vec{\Omega}Ω, and implies that a gyroscope in orbit around the Earth will precess.
(g) Using ϕ = M / r ϕ = M / r phi=-M//r\phi=-M / rϕ=M/r and ζ i = ( 2 / r 3 ) ε i k m x k J m ζ i = 2 / r 3 ε i k m x k J m zeta_(i)=(2//r^(3))epsi_(ikm)x^(k)J^(m)\zeta_{i}=\left(2 / r^{3}\right) \varepsilon_{i k m} x^{k} J^{m}ζi=(2/r3)εikmxkJm, show that
(45.67) Ω = 1 r 3 [ 3 ( J r ) r r 2 J ] + 3 M 2 r 3 r × v (45.67) Ω = 1 r 3 3 ( J r ) r r 2 J + 3 M 2 r 3 r × v {:(45.67) vec(Omega)=(1)/(r^(3))[(3(( vec(J))*( vec(r)))( vec(r)))/(r^(2))-( vec(J))]+(3M)/(2r^(3))* vec(r)xx vec(v):}\begin{equation*} \vec{\Omega}=\frac{1}{r^{3}}\left[\frac{3(\vec{J} \cdot \vec{r}) \vec{r}}{r^{2}}-\vec{J}\right]+\frac{3 M}{2 r^{3}} \cdot \vec{r} \times \vec{v} \tag{45.67} \end{equation*}(45.67)Ω=1r3[3(Jr)rr2J]+3M2r3r×v
The first term in this equation depends on the angular momentum of the Earth J J vec(J)\vec{J}J and gives rise to the Lense-Thirring effect. The second term gives the de Sitter-Fokker effect, usually called geodetic motion. Geodetic motion is a precession effect caused by the motion of the gyroscope around a geodesic and implies that on completing an orbit of a non-rotating mass, a gyroscope will precess. The Lense-Thirring precession is more remarkable as it arises purely from the rotation of the Earth. Notice how the field Ω Ω vec(Omega)\vec{\Omega}Ω causing the precession has the form of an electromagnetic dipole.